Open to Research & Full-time ML/AI Roles

Siddharth
Raj.

ML/AI Researcher  ·  MS CS @ University of Chicago  ·  Ex-Walmart Global Tech

I build ML systems that are both theoretically grounded and production-ready. My work spans RLHF alignment pipelines, geometric computer vision with transformers, and high-throughput NLP systems.

3.8/4.0
MS GPA · UChicago
3+ yrs
Industry ML/SWE
6+
ML Research Projects
Top 5%
Codeforces Globally
12th
ACM ICPC NA Regionals

Who I Am

I'm a Master's student in Computer Science (AI Specialization) at the University of Chicago, with 3+ years of industry experience as a Software Engineer III at Walmart Global Tech building production LLM systems and distributed backends.

My research interests live at the intersection of geometric computer vision (injecting epipolar priors into transformer matchers), LLM alignment (RLHF pipelines, reward modeling, PPO), and efficient ML systems (LoRA, ONNX deployment, MLOps at scale).

I hold a B.Tech from IIT BHU and am a Candidate Master on Codeforces — strong algorithmic foundations shape how I think about every problem I tackle.

I care about work that bridges the gap between research insight and real-world impact. Whether that's getting an RLHF pipeline to run on a single consumer GPU, or diagnosing a confidence distribution shift in a fine-tuned vision transformer, I like to understand systems deeply.

University of Chicago
MS Computer Science — AI Specialization
GPA 3.8 / 4.0  ·  Sep 2025 – Dec 2026
Algorithms  ·  Machine Learning
Distributed Systems  ·  Parallel Programming
Indian Institute of Technology, BHU
B.Tech
GPA 8.7 / 10.0  ·  Jun 2018 – May 2022

What I've Built
RLHF Fine-Tuning
▶ View Project
LLM Alignment · Reinforcement Learning
RLHF Fine-tuning Pipeline for Llama-2-7B

End-to-end 3-stage alignment pipeline: SFT → Reward Model (Bradley-Terry pairwise ranking, 82% human agreement) → PPO optimization with KL-divergence penalty to prevent reward hacking. LoRA adapters cut trainable params from 6.7B to 33M (0.5%), enabling single-GPU training. W&B tracking throughout.

+35% safety score −70% GPU memory 82% human agreement +20% output diversity
PyTorch HuggingFace TRL LoRA / PEFT PPO Llama-2-7B W&B
Epipolar match output
▶ View Report
Computer Vision · Transformers · Geometry
Epipolar Geometry in MatchFormer

Injected differentiable epipolar constraints into a transformer-based feature matcher via a soft fundamental-matrix mask on the confidence matrix. Fine-tuned with focal loss + LoRA adapters on ScanNet; diagnosed confidence distribution shift post-training — threshold recalibration to 0.10 recovered 436 matches/pair while preserving geometric precision.

−94% reprojection error +18% Precision@3px 546× over baseline
PyTorch LoRA ScanNet PyTorch Lightning Fundamental Matrix Focal Loss
NashCab NYC idle driver heatmap
▶ Live Demo
Game Theory · Data Science · NYC Data
NashCab — Nash Equilibrium in Ride-Sharing

Modeled Uber/Lyft as a Mean-Field Game over 259 NYC TLC zones using real HVFHV trip data. Backward induction + fictitious play finds the Nash equilibrium in 166 iterations — drivers at JFK queue for high-fare Manhattan runs. Interactive map shows driver redistribution and hourly demand across the city.

PoA = 1.52 589K rides 166 iters 10,979 drivers
Python D3.js Mean-Field Game Backward Induction NYC TLC Data Fictitious Play
CNN Visualizer
▶ Live Demo
Computer Vision · MLOps · Deployment
Image Classification MLOps Pipeline

Custom PyTorch CNN on CIFAR-10 (BatchNorm + residual connections, Adam + StepLR). Exported to ONNX Runtime via FastAPI for production inference; deployed on AWS ECS via ECR. Grad-CAM explainability surfaced systematic animal misclassifications — targeted augmentation boosted accuracy 2.5%. Model and weights published to HuggingFace.

93% accuracy −40% inference latency HuggingFace published
PyTorch ONNX FastAPI Grad-CAM AWS ECS Docker
Feedback analyzer dashboard
▶ Live Demo
NLP · Topic Modeling · Analytics
NLP Customer Feedback Analyzer

4-stage pipeline over 10,000+ reviews at ~9,100 reviews/sec. Combines VADER lexical sentiment (69.3% star-rating agreement) with BERTopic — sentence transformers + UMAP dimensionality reduction + HDBSCAN clustering — to auto-discover 44 latent topics and extract 10 composite priority pain points. Served via a Flask analytics dashboard.

9,100 reviews/sec 44 auto-clusters 69.3% agreement
Python VADER BERTopic UMAP HDBSCAN Flask
Ray traced render
▶ Live Demo
Systems · Parallel Computing · Graphics
Parallel Ray Tracing Simulator

Multi-threaded ray tracer in Go implementing global illumination, reflections, and shadows. Work-stealing scheduler with lock-free deques distributes tile partitions across 15 cores for load balancing under variable render costs. Profiled on UChicago Peanut cluster with Go's pprof — 12× speedup over sequential baseline.

12× speedup 15-core scaling Lock-free deque
Go Work-Stealing Lock-free pprof Concurrency
Mesh UV Parameterization
▶ Live Demo
Computational Math · Computer Graphics
Procedural Physics Simulation & Mesh Parameterization

Built LSCM UV parameterization from scratch using sparse Cauchy-Riemann equations over bmesh topology — mathematically optimal angle-preserving mapping for curved surfaces. Modeled multi-body collision dynamics via quadratic B-Splines with C0 knot multiplicity at impact points. All geometry, shader nodes, and particle systems generated programmatically via the bpy API.

LSCM from scratch B-Spline physics 100% procedural
Python Blender (bpy) LSCM B-Splines Procedural Shaders Sparse Matrices

Where I've Worked
University of Chicago
Software Engineer — Computing Services
Oct 2025 – Present
Chicago, IL
  • Shipped a speech-to-text research platform on AWS Amplify (Next.js, Python) enabling researchers to train and deploy custom language models; NIST 800-171 compliance; GraphQL APIs over S3 + DynamoDB; Okta / Grouper / Cognito SSO
  • Built multi-level auth and authorization workflows via AWS Cognito + Okta securing 200+ university staff across role-based permission tiers (Inventory Management Platform, DynamoDB)
  • Built Python middleware syncing JIRA with SolarWinds via webhooks and CrowdStrike via APIs, automating IT operations event routing
Walmart Global Tech
Software Engineer III
Jun 2022 – Aug 2025
Bengaluru, India
  • Developed an internal AI chatbot using GPT-4o, LangGraph, and Milvus (RAG) — reduced seller support tickets by 30%; Python backends + React UI, end-to-end
  • Built an Elasticsearch Translation Memory System using KNN + fuzzy search — cut translation costs 30% and enabled 80% localization coverage across seller apps
  • Designed a Redis caching layer over MongoDB — dropped P95 API latency from 800ms to under 200ms across 10+ microservices
  • Parallelized seller onboarding Java workflows — 36× faster (3 days → 2 hours)
  • Designed 10+ Spring Boot microservices with Kafka pipelines (DLQs, zero message loss); mentored 3 junior engineers on production debugging and RAG design
  • Presented "WICR: Smarter and Contextual Language Translation Reviews" at Walmart's internal research summit — top 25 of 800 submissions

Technical Stack
ML / DL Frameworks
PyTorch TensorFlow HuggingFace TRL PEFT / LoRA Transformers Scikit-learn XGBoost PySpark Grad-CAM ONNX Runtime
LLMs & AI Systems
LangChain LangGraph RAG GPT-4o Llama-2 Milvus Pinecone Weights & Biases
Languages
Python C++ Go Java JavaScript C
Cloud & Infrastructure
AWS GCP Azure Docker Kubernetes Kafka Elasticsearch Redis
Databases
MongoDB PostgreSQL DynamoDB Milvus Azure Cosmos DB CassandraDB
Research Methods
RLHF / PPO Fine-tuning Epipolar Geometry Topic Modeling Explainability (XAI) Parallel Computing

Achievements
🏆
Candidate Master — Codeforces
Top 5% of competitive programmers globally. Strong foundations in algorithms, combinatorics, graph theory, and mathematical problem-solving that inform how I approach ML research problems.
🥇
Ranked 12th — ACM ICPC North America Regionals
Top 12 finish among hundreds of teams at one of the most prestigious collegiate programming competitions in North America.
📄
Research Paper — Walmart Global Tech Innovation Summit
"WICR: Smarter and Contextual Language Translation Reviews" — selected among top 25 of 800 submissions for innovation and real-world impact on localization at scale.

Let's Talk

I'm open to research collaborations, full-time ML/AI roles, and interesting conversations about alignment, geometric vision, and building systems that actually work.