Natural language processing

Prototyping a precision oncology 3.0 rapid learning platform

We describe a prototype implementation of a platform that could underlie a Precision Oncology Rapid Learning system. We describe the prototype platform, and examine some important issues and details. In the Appendix we provide a complete walk-through …

Enriching PubMed related article search with sentence level co-citations.

PubMed related article links identify closely related articles and enhance our ability to navigate the biomedical literature. They are derived by calculating the word similarity between two articles, relating articles with overlapping word content. …

Yale Image Finder (YIF): a new search engine for retrieving biomedical images.

UNLABELLED: Yale Image Finder (YIF) is a publicly accessible search engine featuring a new way of retrieving biomedical images and associated papers based on the text carried inside the images. Image queries can also be issued against the image …

Leveraging the structure of the Semantic Web to enhance information retrieval for proteomics.

MOTIVATION: Proteomics researchers need to be able to quickly retrieve relevant information from the web and the biomedical literature. To improve information retrieval, we leverage the structure of the semantic web, developing an approach for …

Mapping terms to UMLS concepts of the same semantic type.

We are interested in mapping terms from the biomedical literature to controlled terminologies. For clinical and related terms, we rely on the MetaMap program for mapping terms to the UMLS Metathesaurus, accepting term assignments that have a …

Towards semantic role labeling & IE in the medical literature.

INTRODUCTION: In this work, we introduce the concept of semantic role labeling to the medical domain. We report first results of porting and adapting an existing resource, Propbank, to the medical field. Propbank is an adjunct to Penn Treebank that …

Term identification in the biomedical literature.

Sophisticated information technologies are needed for effective data acquisition and integration from a growing body of the biomedical literature. Successful term identification is key to getting access to the stored literature information, as it is …

GeneWays: a system for extracting, analyzing, visualizing, and integrating molecular pathway data.

The immense growth in the volume of research literature and experimental data in the field of molecular biology calls for efficient automatic methods to capture and store information. In recent years, several groups have worked on specific problems …

Of truth and pathways: Chasing bits of information through myriads of articles

Knowledge on interactions between molecules in living cells is indispensable for theoretical analysis and practical applications in modern genomics and molecular biology. Building such networks relies on the assumption that the correct molecular …