Santosh Adabala — ML Engineer
Open to WorkCurrently Learning: RLHF (Reinforcement Learning with Human Feedback) & Reward Modeling

Santosh Adabala

ML Engineer • LLM Specialist • AI Systems Architect

0+
Years Experience
0M+
Records Processed
0x
Inference Speedup
$0M+
Fraud Flagged
Scroll
PyTorchHuggingFaceLangChainPySparkDockerKubernetesAWSMLflowONNX RuntimeFastAPITerraformPrometheusTensorFlowscikit-learnDatabricksOpenTelemetry
PyTorchHuggingFaceLangChainPySparkDockerKubernetesAWSMLflowONNX RuntimeFastAPITerraformPrometheusTensorFlowscikit-learnDatabricksOpenTelemetry

About Me

Building ML systems that make LLMs production-ready

IbuildMLsystemsthatmakelargelanguagemodelsfaster,smaller,andproduction-ready.With5+yearsofexperienceapplyingML,NLP,andpredictiveanalyticstohealthcareandinsurancedata,Ispecializeinclinicaltextanalysis,modelcompression,andproductionMLpipelines.

distillation_pipeline.py

Impact Metrics

110M→65M
Model Compression
3.5x
Inference Speed
93.2%
F1 Retained
4,350/s
Docs Processed

Education

M.S. Data Science — CU Boulder

B.S. ECE — JNTU Kakinada

View Degree

Location

Colorado, United States

🟢 Available for opportunities

Awards

2023 DataScience Hackathon

Extra Mile Award (Team & Individual)

🧠

Currently Learning

RLHF (Reinforcement Learning with Human Feedback) & Reward Modeling

System Studio

Choose a real problem, tune the priority, and see the architecture tradeoffs behind the work

Latency-sensitive healthcare ML

Clinical NLP Compression

Make a clinical entity model smaller and faster without giving up useful accuracy.

Clinical Text1Teacher Model2Distillation3Optimized Serve4Monitoring5
Clinical Text
Stage 1/5

Validate schema, source quality, and entity coverage before training.

Optimization Focus

Latency39ms11ms
Model Size411MB62.6MB
F1 Retainedbaseline93.2%

Decision

Serve the compact model first

Prioritize the distilled model and keep the larger model as a quality reference.

Result

39ms to 11ms inference with a much smaller artifact.

92% confidence

Production judgment: smaller models, measured tradeoffs, and deployment-aware evaluation.

Open Project

Skills & Expertise

An interactive constellation of technologies — drag, hover, and filter

Drag nodes to rearrange · Hover for details · Filter by category

Featured Projects

Recruiter mode emphasizes business impact, metrics, and project proof.

LLM Architecture
Details

Wisdom Vault — The Inheritable Agent

An AI-powered inheritance system that lets a parent's decision patterns be inherited through cryptographically scoped tokens via Auth0 Token Vault. Uses Claude for wisdom extraction and multi-generational delegation with 2-of-3 trustee multi-sig.

PythonFlaskAuth0Claude APIJWT+1
LLMRAG Pipeline
Details

Agentic AI Parenting Assistant

An AI-driven modular Parenting Agent built with Google's Agent Development Kit (ADK) and FatSecret integration. Features specialized sub-agents for parenting advice, nutrition meal planning, and basic medical guidance with stateful sessions.

PythonGoogle ADKFatSecret APILangChainAgentic AI
The patient tookAspirinforheadacheDRUGDISEASEGENENamed Entity Recognition
Details

Clinical NLP — Healthcare NER Pipeline

End-to-end ML engineering project for clinical Named Entity Recognition. Distilled Bio_ClinicalBERT (110M) into DistilClinicalBERT (65M) retaining 93.2% F1, achieving 3.5x faster inference (39ms→11ms) and 6.6x model compression (411MB→62.6MB). Built distributed processing on AWS EMR handling 900K records in 3.5 minutes at 4,350 docs/sec, with weak labeling over 7K PubMed abstracts generating 19.5K entities.

PyTorchHuggingFaceONNX RuntimePySparkAWS EMR+3
LLM Architecture
Details

Cold Email Generator

A cold email generator for service companies built with Groq, LangChain, and Streamlit. Extracts job listings from career pages and crafts personalized emails with relevant portfolio links from a vector database.

GroqLangChainStreamlitChromaDBPython
Q·KQ·KQ·KFeed ForwardSoftmax OutputTransformer Block
Details

SemEval-2024 Task 8a — AI Text Detection

Research project for SemEval-2024 on detecting machine-generated text. Experimented with DistilBERT, DeBERTa, RoBERTa, and ALBERT on 119K+ samples. DeBERTa tokenizer + RoBERTa model achieved best results.

PyTorchHuggingFaceDeBERTaRoBERTaNLP
DataTrainEvalDeploymonitoring & feedbackMLOps Pipeline
Details

Space Data Analytics Dashboard

Comprehensive analytics project covering 4,630+ space missions from 1957–2022. Features interactive Power BI dashboards analyzing launch trends, mission success rates, rocket usage, and global cooperation patterns.

Power BIDAXPythonData Modeling
DataTrainEvalDeploymonitoring & feedbackMLOps Pipeline
Details

PHI/PII Parser — FHIR Data Redaction

A service that reads HL7 FHIR Bundle JSON files from AWS S3, detects and redacts PII/PHI fields using key-name matching and regex patterns, and outputs cleaned CSVs. Supports both FastAPI local deployment and serverless AWS Lambda with automatic S3 triggers.

PythonFastAPIAWS LambdaS3Docker+1
DataTrainEvalDeploymonitoring & feedbackMLOps Pipeline
Details

Data-Center Scale Computing

Projects focused on distributed computing at data-center scale, implementing scalable data processing pipelines and cloud-native architectures for large-scale ML workloads.

PythonPySparkAWSDistributed Systems

Experience

My career journey in ML and AI

Aug 2024 – PresentWork

Machine Learning Engineer

Blue Cross Blue Shield of Colorado

Used ClinicalBERT, Python, and Scikit-learn to analyze clinical notes, claims narratives, and prior authorization text, helping medical review teams find relevant clinical patterns faster. Built Python and SQL-based models to prioritize high-risk claims, predict likely denials, and surface high-cost cases. Built HIPAA-compliant data pipelines with Azure Data Factory, Azure Synapse, and Python to process claims from 6+ payer systems, reducing manual data preparation by 30%. Developed Power BI dashboards to track denial rates, turnaround time, and provider performance.

May 2023 – Aug 2023Work

Data Science Intern

Parlay (Techstars '23)

Engineered PySpark data lake architecture on AWS S3 with Apache Hudi, achieving 40% reduction in batch processing time for 500K+ daily records. Automated data ingestion and transformation pipelines using Python multiprocessing and SQLAlchemy. Designed REST APIs to standardize internal data access, reducing integration time for new data sources by 60%.

Jul 2019 – Aug 2022Work

Machine Learning Engineer

Accenture – Sun Life Insurance Client

Built predictive models with Random Forest, XGBoost, and Scikit-learn to analyze insurance claims, policyholder behavior, and risk patterns. Developed anomaly detection logic to flag suspicious billing activity and high-risk policy behavior. Prepared ML-ready datasets using Azure Data Factory, Azure Synapse, PySpark, and Python, improving ingestion throughput by 50% and reducing reporting costs by 31%. Deployed Azure Databricks pipelines with data quality checks, logging, and SLA monitoring.

2023Award

2023 DataScience Hackathon

Hackathon Award

Recognized for building innovative data science solutions. Also received Extra Mile Awards in both team and individual categories at Accenture.

Certifications

Professional credentials and continuous learning

AWS

AWS Certified Machine Learning Engineer - Associate

Amazon Web Services· Mar 2026 · Expires Mar 2029

View Credential
AWS

AWS Certified Solutions Architect

Amazon Web Services

View Credential
N

Applications of AI for Anomaly Detection

NVIDIA· Nov 2024

View Credential

The Structured Query Language (SQL)

University of Colorado Boulder· May 2023

Data Analysis with R Programming

Google· May 2024

Share Data Through the Art of Visualization

Google· May 2024

Analyze Data to Answer Questions

Google· May 2024

Process Data from Dirty to Clean

Google· May 2024

Ask Questions to Make Data-Driven Decisions

Google

View Credential

Prepare Data for Exploration

Google

View Credential

Introduction to Programming and Tidyverse

University of Colorado Boulder· Aug 2021

R Programming and Tidyverse Capstone Project

University of Colorado Boulder· Sep 2022

Data Analysis with Tidyverse

University of Colorado Boulder· Aug 2022

U

The Ultimate MySQL Bootcamp

Udemy

View Credential

GitHub Activity

Live stats and contributions

12Stars
3Forks
0Repos

Currently Building

AlignLLM — LLM Alignment Pipeline on AWS

Building a 7B chat model alignment pipeline using SFT, DPO, RLHF, and LoRA/QLoRA on AWS SageMaker with Terraform infrastructure.

Progress45%
Follow progress

Top Languages

Python
92%
SQL
85%
HTML
70%
Java
65%
Jupyter Notebook
55%

Contribution Graph

JanFebMarAprMayJunJulAugSepOctNovDec
LessMore

Get in Touch

Let's connect and discuss opportunities

Send a Message

What People Say

Santosh is great at taking a messy ML problem and turning it into something that actually works in production. He helped us rethink our model compression approach and was always willing to dig into the details when things did not go as expected. Solid engineer, easy to work with.

DC

David Chen

ML Engineering Lead, Blue Cross Blue Shield Association