Dragon exploratory system on Hepatitis C Virus (DESHCV)

S.K. Kwofie, A. Radovanovic, V.S. Sundararajan, M. Maqungo, A. Christoffels, V.B. Bajic
Infect Genet Evol, 11(4):734-9, (2011)

Dragon exploratory system on Hepatitis C Virus (DESHCV)

Keywords

Hepatitis C Virus, Text-mining, Dictionaries, Biomedical concepts, Database, Hypotheses generation

Abstract

​Even though Hepatitis C Virus (HCV) cDNA was characterized about 20 years ago, there is insufficient understanding of the molecular etiology underlying HCV infections. Current global rates of infection and its increasingly chronic character are causes of concern for health policy experts. Vast amount of data accumulated from biochemical, genomic, proteomic, and other biological analyses allows for novel insights into the HCV viral structure, life cycle and functions of its proteins. Biomedical text-mining is a useful approach for analyzing the increasing corpus of published scientific literature on HCV. We report here the first comprehensive HCV customized biomedical text-mining based online web resource, dragon exploratory system on Hepatitis C Virus (DESHCV), a biomedical text-mining and relationship exploring knowledgebase was developed by exploring literature on HCV. The pre-compiled dictionaries existing in the dragon exploratory system (DES) were enriched with biomedical concepts pertaining to HCV proteins, their name variants and symbols to make it suitable for targeted information exploration and knowledge extraction as focused on HCV. A list of 32,895 abstracts retrieved via PubMed database using specific keywords searches related to HCV were processed based on concept recognition of terms from several dictionaries. The web query interface enables retrieval of information using specified concepts, keywords and phrases, generating text-derived association networks and hypotheses, which could be tested to identify potentially novel relationship between different concepts. Such an approach could also augment efforts in the search for diagnostic or even therapeutic targets.

Code

DOI: 10.1016/j.meegid.2010.12.006

Sources

Website PDF

See all publications 2011