Grant Details
Grant Number: |
5R21CA202130-02 Interpret this number |
Primary Investigator: |
Egleston, Brian |
Organization: |
Research Inst Of Fox Chase Can Ctr |
Project Title: |
Deep Learning for Representation of Codes Used for Seer-Medicare Claims Research |
Fiscal Year: |
2017 |
Abstract
DESCRIPTION (provided by applicant): We propose developing an algorithm and user-friendly software to better identify treatments using Medicare claims data. We will validate our approach using procedures listed in the Surveillance, Epidemiology, and End Results (SEER) database as a gold standard. In this way, we hope to better match procedures identified using Medicare claims data with SEER listed procedures. The focus of this research is observational (i.e. non-randomized) data. Well-run randomized clinical trials can provide the best level of evidence of treatment effects. However, randomized trials in the United States have suffered from poor accrual for many interventions. Despite the fact that well-designed randomized clinical trials should be the gold standard, well-designed observational studies might be the only method of obtaining inferences concerning comparative effectiveness for some cancer interventions. In cancer research, one of the most commonly used databases for observational research is the linked SEER-Medicare database. SEER-Medicare data has provided useful measurements of the effectiveness of a number of cancer therapies. Algorithms for identifying relevant treatment and diagnosis codes using Medicare data are often based on clinical reasoning and scientific evidence. One group of researchers, for example, developed an algorithm for identifying laparoscopic surgery among kidney cancer cases before claims codes for laparoscopic surgery were well developed. While such algorithms are useful for others pursuing similar investigations, there may still be substantial mismatch between treatment identified by the SEER cancer registry and treatment identified through Medicare claims. In this work, we propose developing a rigorous machine learning algorithm that can help researchers in better identifying treatments in Medicare claims data. Specifically, we will design a neural language modeling algorithm and implement a software system that finds vector representations of diagnosis and procedure codes. We plan on using the neural language modeling algorithm to learn vector representations from SEER- Medicare claims data where related procedure and diagnosis codes are "neighbors" (i.e. closely related). We will investigate whether the codes we identify within neighborhoods correspond to the procedure codes used for published SEER-Medicare studies. We will then design a software assistant interface that will allow an investigator to explore which codes are related to a given seed of diagnosis or procedure codes. Finally, we will investigate the sensitivity and specificity of the algorithm by comparing procedures identified using Medicare claims with procedures listed in the SEER database. We will replicate analyses from a published SEER-Medicare paper to investigate if estimated treatment effects differ when using our novel algorithm compared to using the algorithm in the published paper.
Publications
Benefits versus drawbacks of delaying surgery due to additional consultations in older patients with breast cancer.
Authors: Egleston B.L.
, Bleicher R.J.
, Fang C.Y.
, Galloway T.J.
, Vucetic S.
.
Source: Cancer Reports (hoboken, N.j.), 2023-03-21 00:00:00.0; , p. e1805.
EPub date: 2023-03-21 00:00:00.0.
PMID: 36943210
Related Citations
Using Pointwise Mutual Information for Breast Cancer Health Disparities Research With SEER-Medicare Claims.
Authors: Egleston B.L.
, Chanda A.K.
, Bai T.
, Fang C.Y.
, Bleicher R.J.
, Vucetic S.
.
Source: Methodology : European Journal Of Research Methods For The Behavioral & Social Sciences, 2023; 19(1), p. 43-59.
EPub date: 2023-03-31 00:00:00.0.
PMID: 37090814
Related Citations
MedCV: An Interactive Visualization System for Patient Cohort Identification from Medical Claim Data.
Authors: Chanda A.K.
, Bai T.
, Egleston B.L.
, Vucetic S.
.
Source: Proceedings Of The ... Acm International Conference On Information & Knowledge Management. Acm International Conference On Information And Knowledge Management, 2022 Oct; 2022, p. 4828-4832.
EPub date: 2022-11-04 00:00:00.0.
PMID: 36636516
Related Citations
Statistical inference for natural language processing algorithms with a demonstration using type 2 diabetes prediction from electronic health record notes.
Authors: Egleston B.L.
, Bai T.
, Bleicher R.J.
, Taylor S.J.
, Lutz M.H.
, Vucetic S.
.
Source: Biometrics, 2020-07-22 00:00:00.0; , .
EPub date: 2020-07-22 00:00:00.0.
PMID: 32700317
Related Citations
Medical Concept Representation Learning from Multi-source Data.
Authors: Bai T.
, Egleston B.L.
, Bleicher R.
, Vucetic S.
.
Source: Ijcai : Proceedings Of The Conference, 2019 Jul; 2019, p. 4897-4903.
PMID: 32116463
Related Citations
EHR phenotyping via jointly embedding medical concepts and words into a unified vector space.
Authors: Bai T.
, Chanda A.K.
, Egleston B.L.
, Vucetic S.
.
Source: Bmc Medical Informatics And Decision Making, 2018-12-12 00:00:00.0; 18(Suppl 4), p. 123.
EPub date: 2018-12-12 00:00:00.0.
PMID: 30537974
Related Citations
Interpretable Representation Learning for Healthcare via Capturing Disease Progression through Time.
Authors: Bai T.
, Egleston B.L.
, Zhang S.
, Vucetic S.
.
Source: Kdd : Proceedings. International Conference On Knowledge Discovery & Data Mining, 2018 Aug; 2018, p. 43-51.
PMID: 31037221
Related Citations
On the Use of Summary Comorbidity Measures for Prognosis and Survival Treatment Effect Estimation.
Authors: Gilbert E.A.
, Krafty R.T.
, Bleicher R.J.
, Egleston B.L.
.
Source: Health Services & Outcomes Research Methodology, 2017 Dec; 17(3-4), p. 237-255.
EPub date: 2017-06-21 00:00:00.0.
PMID: 29176931
Related Citations
Joint Learning of Representations of Medical Concepts and Words from EHR Data.
Authors: Bai T.
, Chanda A.K.
, Egleston B.L.
, Vucetic S.
.
Source: Proceedings. Ieee International Conference On Bioinformatics And Biomedicine, 2017 Nov; 2017, p. 764-769.
EPub date: 2017-12-18 00:00:00.0.
PMID: 29375929
Related Citations