×

Making Free Text Reports Available for Researchers via a De-Identification and Natural Language Processing Pipeline

Overview:

With the recent advancements in predictive modeling and Machine Learning (ML) in health care, availability and accessibility of biomedical data provides an exciting opportunity for researchers. However, given majority of the clinical data is being unstructured, reproducible and scalable informatics tools that complies with regulations are needed. Dr. Topaloglu will present a set of tools that enables processing, de-identification, extraction of concepts and named entities, and deploying a search tool to provide researcher access and search notes prior to an IRB approval. These tools have been approved by the Privacy Board and the Compliance Office and implemented at Wake Forest Baptist Medical Center. We will also discuss use cases for a newly developed specimen search tool for pathology and radiology reports. In addition, the seminar also provides on overview of an ML-based natural language processing implementations for specific data marts and potential improvements via federated learning paradigm.

Speaker:

Umit Topaloglu, PhD, FAMIA, is a biomedical informatician with more than 15 years of research experience in semantic research data frameworks that includes standard based data collection, Natural Language Processing and Machine Learning as well as data quality dimension of those endeavors. He has been working on implementation of an innovative scalable clinical research informatics systems for the last 10+ years. He has developed unified research informatics solutions and strategies across research spectrum which is successfully used by the National Children’s Study and many cancer and translational research studies. He is also co-leading the Oncology Domain Team at the National COVID Cohort Collaborative (N3C). He currently serves as the Associate Director for the Center for Biomedical Informatics, CTSI Informatics Program, and the Wake Forest Baptist Comprehensive Cancer Center.

Watch previous presentations and to find more information about future seminars.

Originally recorded on Wednesday, March 31, 2021, as part of CHOIR's 

Overview:

With the recent advancements in predictive modeling and Machine Learning (ML) in health care, availability and accessibility of biomedical data provides an exciting opportunity for researchers. However, given majority of the clinical data is being unstructured, reproducible and scalable informatics tools that complies with regulations are needed. Dr. Topaloglu will present a set of tools that enables processing, de-identification, extraction of concepts and named entities, and deploying a search tool to provide researcher access and search notes prior to an IRB approval. These tools have been approved by the Privacy Board and the Compliance Office and implemented at Wake Forest Baptist Medical Center. We will also discuss use cases for a newly developed specimen search tool for pathology and radiology reports. In addition, the seminar also provides on overview of an ML-based natural language processing implementations for specific data marts and potential improvements via federated learning paradigm.

Speaker:

Umit Topaloglu, PhD, FAMIA, is a biomedical informatician with more than 15 years of research experience in semantic research data frameworks that includes standard based data collection, Natural Language Processing and Machine Learning as well as data quality dimension of those endeavors. He has been working on implementation of an innovative scalable clinical research informatics systems for the last 10+ years. He has developed unified research informatics solutions and strategies across research spectrum which is successfully used by the National Children’s Study and many cancer and translational research studies. He is also co-leading the Oncology Domain Team at the National COVID Cohort Collaborative (N3C). He currently serves as the Associate Director for the Center for Biomedical Informatics, CTSI Informatics Program, and the Wake Forest Baptist Comprehensive Cancer Center.

Watch previous presentations and to find more information about future seminars.