Research overview

My research is in machine learning/data mining and natural language processing, with an emphasis on applications in health informatics.

For example, one of my major ongoing research aims concerns optimizing the processes of evidence-based medicine using novel natural language processing and machine learning methods. The aim is to reduce the (human) workload involved in conducting systematic reviews (i.e., making sense of the biomedical literature), so that we can realize the aim of evidence-based care in an era of information overload. An ongoing project in this direction is RobotReviewer.

More broadly, I am interested in core machine learning and natural language processing issues: e.g., structured and unstructured classiļ¬cation techniques; neural models; semi-supervised learning methods; learning from imbalanced data; and learning from alternative forms of supervision. I tend to be most excited by interdisciplinary research that motivates technical questions by way of interesting applications.

A random sample of semi-recent publications

Ye Zhang, Stephen Roller and Byron C. Wallace. MGNC-CNN: A Simple Approach to Exploiting Multiple Word Embeddings for Sentence Classification North American Chapter of the Association for Computational Linguistics (NAACL); 2016.

Elisa Ferracane, Iain Marshall, Byron C. Wallace and Katrin Erk. Leveraging coreference to identify arms in medical abstracts: An experimental study The International Workshop on Health Text Mining and Information Analysis (Co-Located with EMNLP 2016); 2016.

Zhiguo Yu, Trevor Cohen, Todd R. Johnson, Byron C Wallace and Elmer Bernstam. Retrofitting Word Vectors of MeSH Terms to Improve Semantic Similarity Measures The International Workshop on Health Text Mining and Information Analysis (Co-Located with EMNLP 2016); 2016.


09/13/2016 AHRQ grant funded

My AHRQ R03 grant, Hybrid Approaches to Optimizing Evidence Synthesis via Machine Learning and Crowdsourcing, has been selected for funding!

07/30/2016 Panelist @ IEEE ICHI

I'll be sitting on a panel on computational methods for evidence synthesis at IEEE ICHI 2016.

06/01/2016 Talk @ U Lisbon/INESC-ID

I'll be giving a talk at the University of Lisbon this June.

05/19/2016 NIH grant funded

Our NIH "Big Data to Knowledge" proposal, Crowdsourcing Mark-up of the Medical Literature to Support Evidence-Based Medicine and Develop Automated Annotation Capabilities has been selected for funding! This is a collaborative effort with Ani Nenkova and Zachary Ives.


My work has been supported with grants from the National Institutes of Health, National Science Foundation, the Army Research Office, Seton hospital, Amazon and seed funds from Brown University.