Summary
Overview
Work History
Education
Skills
Certification
Projects
Websites
Timeline
Generic

Habeeb Ajibola

Conyers

Summary

Dynamic data analyst with a proven track record at ICARE Clinic, leveraging advanced AI techniques and deep learning frameworks to drive actionable insights. Expert in data quality and cleaning strategies, I excel in synthesizing complex data into impactful visualizations, fostering collaboration, and enhancing decision-making processes. Passionate about innovation and continuous improvement.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Graduate Research Assistant

Georgia State University
Atlanta
01.2025 - 06.2025
  • Analyzed over 105,000 patent abstracts using advanced NLP techniques to extract and model topics.
  • Evaluated GloVe and BERT word embedding strategies for topic coherence in technical patent corpora.
  • Implemented hierarchical topic modeling with seed word guidance to refine domain-specific themes.
  • Generated actionable insights for innovation trend mapping and generative AI applications.

Data Analyst & IT Intern

ICARE Clinic & MedSpa LLC
Decatur
03.2022 - 06.2024
  • Extracted, cleaned, and analyzed patient data through Tebra’s reporting modules to identify trends in provider utilization and no-show rates.
  • Generated automated audit trail reports to support HIPAA compliance and system security audits.
  • Synthetized insights on ROI and digital engagement using Tebra’s Performance Dashboard for executive performance tracking.
  • Created condition-specific patient cohorts via SQL-like queries to enhance population health analysis and quality improvement initiatives.

Education

Master of Science - Computer Information Systems (Big Data Analytics)

Georgia State University
Atlanta, GA
07.2025

Bachelor of Science - Computer Science

Troy State University
Troy, AL
05.2024

Skills

  • Advanced AI techniques: Generative, Agentic, RAG systems, Transformer NLP
  • Deep learning frameworks
  • Synthetic data methodologies
  • Data quality and cleaning strategies
  • Analytics and machine learning architecture
  • Feature engineering with LLMs and GANs
  • Data storage methods: ETL, warehouses, lakes, lakehouses
  • Vector databases: Pinecone, Graph database
  • Cloud platforms: AWS, GCP
  • Containerization: Docker, Kubernetes
  • Infrastructure as code: Terraform
  • Data processing: Pyspark, SQLite, Redshift, MongoDB
  • Data visualization: Power BI, Tableau, Matplotlib
  • Machine learning libraries: PyTorch, TensorFlow, NumPy, Pandas
  • Database management: SQL and NoSQL

Certification

  • AWS Cloud Practitioner
  • AWS Machine Learning Associate
  • AWS Solution Architect Associate

Projects

  • Ai-Powered Multi-Modal Deepfake Detection System, Developed and trained modality-specific detection engines—Vision Transformer for image analysis, ResNeXt-LSTM pipeline for video classification, and fine-tuned BERT for text—achieving F1 scores above 0.90 across modalities., Architected a cloud-native data pipeline on AWS, leveraging S3 for scalable storage, AWS Glue crawlers for automated schema cataloging, and Athena for serverless querying of over 500K images and 1,000+ videos., Developed user-centric visualization tools including a Graido interface for real-time testing and Power BI dashboards with Grad-CAM and temporal saliency maps for model interpretability.
  • Gen Ai Symptom Checker Application, Designed and implemented a retrieval-augmented GenAI system using LangChain and Pinecone to ground LLM responses in a structured medical knowledge base., Built and fine-tuned urgency-classification and symptom-inference models on a synthetic SQLite database of 5K+ patient records and 15K+ symptom reports, incorporating PII scrubbing and prompt engineering., Developed a Streamlit chat-style UI enabling conversational triage, achieving 75–80% overall accuracy and 100% recall on critical emergency scenarios during evaluation.
  • Ai-Driven Analysis of U.S National Parks, Collected and processed 20,000+ data points from the National Park Service API using R, performing data wrangling and feature engineering for predictive analysis., Developed and optimized machine learning (GBM) and deep learning (H2O) models to forecast visitor engagement and park popularity., Conducted feature selection, correlation analysis, and model tuning, delivering data-driven insights on visitor trends and seasonal demand to enhance park resource allocation and operations.

Timeline

Graduate Research Assistant

Georgia State University
01.2025 - 06.2025

Data Analyst & IT Intern

ICARE Clinic & MedSpa LLC
03.2022 - 06.2024

Master of Science - Computer Information Systems (Big Data Analytics)

Georgia State University

Bachelor of Science - Computer Science

Troy State University
Habeeb Ajibola