Summary
Overview
Work History
Education
Skills
Publications
Links
Timeline
Generic

Lavan Bathija

Data Scientist
Atlanta,Georgia

Summary

Experienced Data Scientist adept at harnessing the power of large datasets to develop innovative machine learning models for solving complex analytics and predictive challenges. Proficient in a wide array of technical domains including Machine Learning, Data Mining, Deep Learning, Computer Vision, Pattern Recognition, and Information Retrieval. Specialized expertise in Natural Language Processing, with a keen interest in cutting-edge techniques such as language transformers, BERT, attention-based algorithms, and social media data mining. Known for adaptability and flexibility in navigating diverse projects, with a strong commitment to delivering production-level code of the highest quality. Actively seeking roles centered around Applied Data Science, Machine Learning, Natural Language Understanding, and Algorithmic problem-solving to drive impactful solutions in the field.

Overview

4
4
years of professional experience
4
4
years of post-secondary education

Work History

Data Scientist

International Consulting Associates
Arlington, WA
03.2022 - Current
  • Led development of advanced ensemble NLP model for precise problem code assignment in FDA Medical Device Reports (MDRs), enhancing device performance monitoring and safety identification.
  • Engineered a biostatistical solution using Likelihood Ratio Tests (LRT) methodology for identifying significant adverse events rates, improving real-time safety monitoring.
  • Spearheaded International Data initiative, analyzing data from global regulatory bodies to expand surveillance score and uncover unique issues.
  • Directed a Reddit Web-scraping project to collect data on medical device malfunctions and side effects, providing insights for FDA's approval criteria and manufacturer accountability.
  • Developed and deployed data processing pipelines and large scale machine learning models for various projects, enabling discovery of novel issues.

Research Assistant

Intelligent Information Systems Laboratory, PSU
State College, PA
05.2021 - 12.2021
  • Worked on Language based transformer models like BERT and RoBERTA to predict and analyze trends for COVID-19 Vaccination Hesitancy from Twitter Data.
  • Published a Research Paper in International Conference on Pattern Recognition 2022 (ICPR 2022) based on results of this project.

Data Science Intern

Globus Housing Limited
, WY
03.2020 - 09.2020
  • Worked in collaboration with Data Science team to develop ensemble models (Random Forests, XG Boost, Regression) to predict changing trends in housing prices over span of 10 years.

Education

Bachelor of Science - Computational Data Science

Pennsylvania State University
University Park, PA
08.2017 - 12.2021

Skills

Deep Learning

undefined

Publications

AAAI 2022 - Sentiment and Stance Detection in Tweets related to Covid-19 Vaccination.

  • The primary purpose of this paper was to develop a system that could classify the stance and sentiments from tweets related to Covid-19 Vaccination.
  • The paper proposes a detailed study about the changes in the stance towards vaccination over the course of the coronavirus pandemic using the results generated by deep learning ensemble models consisting of BERT, RoBERTA and CoviTwitterBERT.

Links

Github: https://github.com/lavanbth99

Kaggle: https://www.kaggle.com/lavanbth99

Timeline

Data Scientist

International Consulting Associates
03.2022 - Current

Research Assistant

Intelligent Information Systems Laboratory, PSU
05.2021 - 12.2021

Data Science Intern

Globus Housing Limited
03.2020 - 09.2020

Bachelor of Science - Computational Data Science

Pennsylvania State University
08.2017 - 12.2021
Lavan BathijaData Scientist