Text Mining for Hypotheses and Results in Translational Medicine Studies

Tsai, Terry H.; Kasch, Niels; Pfeifer, Craig; Oates, Tim

Text Mining for Hypotheses and Results in Translational Medicine Studies

dc.contributor.author	Tsai, Terry H.
dc.contributor.author	Kasch, Niels
dc.contributor.author	Pfeifer, Craig
dc.contributor.author	Oates, Tim
dc.date.accessioned	2025-11-21T00:29:57Z
dc.date.issued	2014-12-14
dc.description	2014 IEEE International Conference on Data Mining Workshop, December 14, 2014, Shenzhen, China
dc.description.abstract	Most common and complex diseases, such as diabetes and cancer, are influenced at some level by variation in the genome. To truly address the goal of translational research, genetic variation must be taken into consideration. Research done in public health genetics, specifically in the area of single nucleotide polymorphisms (SNPs), is the first step to understanding human genetic variation. In addition, novel methods are needed to represent and to conduct text mining over textual genotypic data sources. In this paper, we describe the development and evaluation, in the context of a genetic study, of a translational-informatics method that supports both machine-learning text mining (e.g., Conditional random fields) and automated inference for identifying key concepts (e.g., Hypotheses and results). After scaling for inter-annotator agreement, our adjusted overall precision was 64%, with a range of 48% to 80%. While other biological text mining systems have focused on named-entity recognition, the development of tools for genetic studies focusing on hypotheses and results has been relatively rare.
dc.description.sponsorship	This work was supported with the following grant: NIH NLM T15LM007452. The authors would like to thank Harold Lehmann and Rui Zhang for their suggestions and advice. We would also like to thank Linda Kao, Nisa McArthur, and Lili Zhang for their help in annotating the documents.
dc.description.uri	https://ieeexplore.ieee.org/document/7022589
dc.format.extent	6 pages
dc.genre	conference papers and proceedings
dc.genre	preprints
dc.identifier	doi:10.13016/m282iv-mqwm
dc.identifier.citation	Tsai, Terry H., Niels Kasch, Craig Pfeifer, and Tim Oates. “Text Mining for Hypotheses and Results in Translational Medicine Studies.” 2014 IEEE International Conference on Data Mining Workshop, December 2014, 127–32. https://doi.org/10.1109/ICDMW.2014.39.
dc.identifier.uri	https://doi.org/10.1109/ICDMW.2014.39
dc.identifier.uri	http://hdl.handle.net/11603/40815
dc.language.iso	en
dc.publisher	IEEE
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Faculty Collection
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department
dc.relation.ispartof	UMBC Student Collection
dc.rights	© 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.subject	Genomics
dc.subject	Natural language processing
dc.subject	Diabetes
dc.subject	Medical diagnostic imaging
dc.subject	Translational informatics
dc.subject	Diseases
dc.subject	Biomedical informatics
dc.subject	Text mining
dc.subject	UMBC Accelerated Cognitive Cybersecurity Laboratory
dc.subject	UMBC Ebiquity Research Group
dc.subject	Bioinformatics
dc.subject	Gene-environment interaction studies
dc.subject	UMBC Cognition, Robotics, and Learning (CoRaL) Lab
dc.title	Text Mining for Hypotheses and Results in Translational Medicine Studies
dc.type	Text

Files

Original bundle

Now showing 1 - 1 of 1

Name:: PID3434049.pdf
Size:: 340.99 KB
Format:: Adobe Portable Document Format

Download

Collections

UMBC Faculty Collection
UMBC Computer Science and Electrical Engineering Department
UMBC Student Collection