- Biomedical text mining
Biomedical text mining (also known as BioNLP) refers to text mining applied to texts and literature of the biomedical and molecular biology domain. Itis a rather recent research field on the edge of
natural language processing, bioinformatics, medical informaticsand computational linguistics.
There is an increasing interest in
text miningand information extraction strategies applied to the biomedical and molecular biologyliterature due to the increasing number of electronically available publications stored in databases such as PubMed.
The main developments in this area have been related to the identification of biological entities (named entity recognition), such as
proteinand genenames in free text, the association of gene clusters obtained by microarray experiments with the biological context provided by the corresponding literature, automatic extraction of protein interactions and associations of proteins to functional concepts (e.g. gene ontologyterms). Even the extraction of kinetic parameters from text or the subcellular location of proteins have been addressed by information extraction and text mining systems.
* [http://www.xtractor.in/ XTractor] - Discovering Newer Scientific Relations Across
PubMedAbstracts. A tool to obtain manually annotated relationships for Proteins, Diseases, Drugs and Biological Processes as they get published in PubMed.
* [http://www.chilibot.net/ Chilibot] : A tool for finding relationships between genes or gene products.
* [http://www.ihop-net.org/UniPub/iHOP/ Information Hyperlinked Over Proteins (iHOP)] (ref.: [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=16204114 Bioinformatics, 2005 Sep 1;21 Suppl 2:ii252-ii258.] ): "A network of concurring genes and proteins extends through the scientific literature touching on phenotypes, pathologies and gene function. iHOP provides this network as a natural way of accessing millions of PubMed abstracts. By using genes and proteins as hyperlinks between sentences and abstracts, the information in PubMed can be converted into one navigable resource, bringing all advantages of the internet to scientific literature research."
* [http://fable.chop.edu FABLE] : A gene-centric text-mining search engine for MEDLINE
* [http://www.gopubmed.org GoPubMed] : retrieves
PubMedabstracts for your search query, then detects ontology terms from the Gene Ontologyand Medical Subject Headingsin the abstracts and allows the user to browse the search results by exploring the ontologies and displaying only papers mentioning selected terms, their synonyms or descendants.
* [http://www.litinspector.org LitInspector] - Gene and signal transduction pathway data mining in
* [http://www.pubgene.org PubGene] -
Co-occurrence networksdisplay of gene and protein symbols as well as MeSH, GO, PubChemand interaction terms (such as "binds" or "induces") as these appear in MEDLINErecords (that is, PubMedtitles and abstracts).
* [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15998455 Text-mining and information-retrieval services for molecular biology]
* [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15886388 Text mining for metabolic pathways, signaling cascades, and protein networks]
* [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15808823 Text-mining approaches in molecular biology and biomedicine]
* [http://blimp.cs.queensu.ca/ Biomedical LIterature Mining Publications (BLIMP)] : A comprehensive and regularly updated index of publications on (bio)medical text mining
Conferences at which BioNLP research is presented
BioNLP is presented at a variety of meetings:
Pacific Symposium on Biocomputing: in plenary session
Intelligent Systems for Molecular Biology: in plenary session and also in the BioLINK and Bio-ontologies workshops
Association for Computational Linguisticsand North American Association for Computational Linguisticsannual meetings and associated workshops: in plenary session and as part of the BioNLP workshop (see below)
* [http://compbio.uchsc.edu/BioNLP2008 BioNLP 2008]
American Medical Informatics Associationannual meeting: in plenary session
* [https://lists.ccs.neu.edu/pipermail/bionlp/ The BioNLP mailing list archives]
* [http://compbio.uchsc.edu/ccp/corpora/index.shtml Corpora for biomedical text mining]
* [http://biocreative.sourceforge.net The BioCreative evaluations of biomedical text mining technologies]
* [http://compbio.uchsc.edu/Hunter_lab/Cohen/bioNlpPeople.html Directory of people involved in BioNLP]
Wikimedia Foundation. 2010.
Look at other dictionaries:
Text mining — Text mining, sometimes alternately referred to as text data mining , roughly equivalent to text analytics , refers generally to the process of deriving high quality information from text. High quality information is typically derived through the… … Wikipedia
National Centre for Text Mining — The National Centre for Text Mining (NaCTeM) was the world’s first publicly funded text mining (TM) centre. It was established to provide support, advice, and information on TM technologies and to disseminate information from the … Wikipedia
Biomedical engineering — For the Russian journal on the subject, see Meditsinskaya Tekhnika. Ultrasound representation of Urinary bladder (black butterfly like shape) and hyperplastic prostate. An example of engineering science and medical science working together … Wikipedia
Mining — This article is about the extraction of geological materials from the Earth. For the municipality in Austria, see Mining, Austria. For the siege tactic, see Mining (military). For name of the Chinese emperor, see Daoguang Emperor. Simplified… … Wikipedia
Data mining — Not to be confused with analytics, information extraction, or data analysis. Data mining (the analysis step of the knowledge discovery in databases process, or KDD), a relatively young and interdisciplinary field of computer science is… … Wikipedia
List mining — can be defined as the use, for purposes of scientific research, of messages sent to Internet based electronic mailing lists. List mining raises novel issues in Internet research ethics. These ethical issues are especially important for health… … Wikipedia
Biovista — Infobox Company company name = Biovista, Inc. company type = Private company slogan = To Seek, To know, To Act foundation = Charlottesville, VA, USA (2005) location = flagicon|USA Charlottesville, VA, USA industry = Biotechnology services = Drug… … Wikipedia
Natural language processing — (NLP) is a field of computer science and linguistics concerned with the interactions between computers and human (natural) languages; it began as a branch of artificial intelligence. In theory, natural language processing is a very attractive… … Wikipedia
PubGene — Infobox Company name = PubGene Inc. type = Privately held genre = foundation = 2001 founder = location city = Boston location country = USA location = locations = area served = Global key people = Eirik Næss Ulseth (CEO) industry = Bio… … Wikipedia
Bioinformatics — For the journal, see Bioinformatics (journal). Map of the human X chromosome (from the NCBI website). Assembly of the human genome is one of the greatest achievements of bioinformatics. Bioinformatics … Wikipedia