COLLEGE DIRECTORY       :      VISIT ELLER      :      LOG IN 
Eller College of Management
Eller College Home > MIS > Artificial Intelligence Laboratory > Research > Genomic Pathway Visualizer (previous research)
Artificial Intelligence Laboratory

Genomic Pathway Visualizer

Previous BioInformatics Research


The Artificial Intelligence Lab has been involved in medical text mining since the early 90s. Several text mining techniques have been designed, developed and tested, including:

  • Arizona Noun Phraser: originally a general English noun phraser, it was adjusted to extract only relevant medical terms from text.
  • Automatic Indexing: stop wording and algorithmic index phrase formation.
  • Concept Space: index phrase co-occurrence information used to generate an automatic thesaurus for search term suggestion.
  • Keyword Suggester: A keyword suggester for the medical domain; it dynamically suggest additional keywords for user queries based on a mapping algorithm that combines Concept Space with the UMLS.
  • Java-based Graphical Thesaurus: graphical display of concept space terms.
  • Java-based Visual Browser: dynamic and scalable self-organizing maps for visualization and categorization of large retrieval set. [demo]

Return to Top


HelpfulMed provides a powerful search vehicle focused on improving the availability of, and access to, medical information on the Internet and in medical databases for professional and advanced users. HelpfulMed offers users three state-of-the-art technologies to search and browse for intelligent and reliable medical information. Our cutting-edge search technology allows users to locate essential cancer information by extracting precise noun phrases and determining relationships with other fine-grained medical terminology through HelpfulMed's proprietary concept-based search support. HelpfulMed's three proprietary technologies are summarized below.

A Medical Spider based on the Hopfield Net spreading activation algorithm was designed to specifically retrieve medical web pages. It is equipped with a medical vocabulary knowledge base. An inlink analysis algorithm which compares the UMLS knowledge base to the text of the web page was developed in order for the spider to assess whether the web page in question is indeed a medical web page.

The Medical Concept Space is an automatically generated thesaurus designed to facilitate concept-based, cross-domain information retrieval. The concept space contains over 48.5 million unique terms and over 1.7 billion relationships. The system suggests highly relevant search terms exactly as they appear in the documents, enhancing information recall. Terms are presented in categories according to their source: author, noun phrase terms, MeSH terms. The top 40 terms are displayed in order based on a weighting/ranking algorithm. The Medical Concept Space was generated on a Silicon Graphics Origin 2000 8-node processor, using 2 weeks of cpu time.

MEDMap is a 2-D multi-layered graphical display of important medical concepts and a document server that supports guided browsing of concepts and documents. MEDMap is based on the Kohonen self-organizing map algorithm. Using as input the 48.5 million unique terms and 1.7 billion relationships from the Medical Concept Space, the system partitions MEDLINE documents into 132.700 categories on 4586 maps. Clicking on a map region will take you down a layer in the multi-layered map, or show you the documents associated with that category. Textual concept labels and colors are used to demarcate regions in the SOM, color has no specific meaning. HelpfulMed was generated on a Silicon Graphics Origin 2000 8-node processor, using 14 days of cpu time.

Return to Top


MedTextUs is an online search system designed to facilitate efficient and precise information retrieval for medical professionals and the general public. The system is built upon advanced technologies in areas of information retrieval, document characterization, and visualization. The core technologies used include meta search, noun phrasing, concept mapper, and SOM (Self-Organizing Map). MedTextus starts by querying the selected medical literature database based on the given keywords. The spider then fetches the documents returned from those databases. After collecting the required number of web pages, further analysis will be performed. Noun phrases will be extracted from the pages, which allows the user to know what key concepts are related to a given document. The concepts can also be visualized in a 2-D map, which categorizes the web pages by collecting them into regions, each of which represents a concept. All these features allow the user to automatically collect information more effectively and represent it in a more meaningful way. [demo]

Return to Top

BioMedical Text Mining Publications

1. H. Chen, A. Lally, B. Zhu, and M. Chau, "HelpfulMed: Intelligent Searching for Medical Information over the Internet,"  Journal of the American Society for Information Science and Technology, 54 (7), 683-694, 2003

2. G. Leroy and H. Chen, “Meeting Medical Terminology Needs-The Ontology-Enhanced Medical Concept Mapper,”  IEEE Transactions on Information Technology in Biomedicine, Volume 5(4), 261-270, 2001.

3. K. Tolle and H. Chen, “Comparing Noun Phrasing Techniques for Use with Medical Digital Library Tools,”
    Journal of the American Society for Information Science, Special Issue on Digital Libraries,     Volume 51(4), 352-370, 2000.

4. A.L. Houston, H. Chen, B. R. Schatz, R. R. Sewell, K. M. Tolle, T. E. Doszkocs, S. M. Hubbard and D. T. Ng, “Exploring the Use of Concept Spaces to Improve Medical Information Retrieval,”
Decision Support Systems, Volume 30 (2), 171-186, 2000.

5. H. Chen, J. Martinez, T. D. Ng and B. R. Schatz, “A Concept Space Approach to Addressing the Vocabulary Problem in Scientific Information Retrieval: An Experiment on the Worm Community System,”
Journal of the American Society for Information Science, Volume 48 (1), 17-31, 1997.

Return to Top

BioMedical Data Mining Publications

1. K. M. Tolle, H. Chen and H. Chow,
Estimating drug/plasma concentration levels by applying neural networks to pharmacokinetic data sets,”
Decision Support Systems, Volume 30 (2), 139-152, 2000.

2. A.L. Houston, H. Chen, S. M. Hubbard, B. R. Schatz, T. D. Ng, R. R. Sewell and K. M. Tolle,   “Medical Data Mining on the Internet: Research on a Cancer Information System,”
Artificial Intelligence Review, Volume 13, 437-466, 1999.

3. H. Chow, K. Tolle, D. Roe, V. Elsberry and H. Chen, “Application of Neural Networks to Population Pharmacokinetics Data Analysis,”
Journal of Pharmaceutical Sciences, Volume 86 (7), 840-845, 1997.

4. H. Chow, H. Chen, T. Ng, P. Myrdal and S. H. Yalkowsky, “Using Backpropagation Networks for the Estimation of Aqueous Activity Coefficients of Aromatic Organic Compounds,”
Journal of Chemical Information and Computer Sciences, American Chemical Society, Volume 3 (4), 723-728, 1995.

Return to Top

For additional information, please contact us.