A little about me
Hi, I'm Bhumish. I'm a CS student at UT Dallas, focused on machine learning and fullstack applications. My time so far has been split across research, side projects, and a bit of open source.
This past year I worked with Dr. Vincent Ng on multimodal misinformation, building the annotation tooling and a 10,000-pair image-text benchmark for evaluating vision-language models on misleading content.
Right now I'm a software engineering intern at FloCareer, working on generative face-replacement and a data-anonymization pipeline. On the side I build products like Cursorly and TryEZ. Outside school, I'm probably playing badminton or practicing guitar.
Where I study
The University of Texas at Dallas
B.S. Computer Science, GPA 3.5/4.0 expected May 2027
coursework
Advanced data structures & algorithms, machine learning, linear algebra, probability & statistics, operating systems, databases, discrete math.
around campus
Machine Learning Researcher at ACM, AI Engineer at AI Society, VP Project Development at GDSC, and president of the UTD badminton club.
certifications
Building Language Models on AWS (AWS), Google AI Essentials (Google), and Build an AI Agent (IBM SkillsBuild).
What I've worked on
-
FFloCareer · Remote
Software Engineering Intern
Jun 2026 to presentBuilding a generative face-replacement stage that swaps identity while preserving expression, gaze, and head pose across a 1M-recording, 750K-hour corpus, keeping the behavioral signal intact for training. Architecting the 8-stage anonymization pipeline across Whisper, pyannote, Presidio NER, and ARX k-anonymity, modeling compute down to a $25K to $165K core from a $0.9M+ baseline.
-
PPfizer
AI Engineering Extern
Jan 2026 to May 2026Built a retrieval-augmented generation system over 5,000+ clinical-supply documents, chunking and embedding them into a vector database with LlamaIndex for natural-language search across pharmaceutical PDFs. Added an ensemble OCR pipeline on Tesseract, PaddleOCR, and EasyOCR that pulls structured fields from messy labels and logistics forms, served through a Gradio chatbot for real-time document Q&A.
-
SSocial Summer of Code (SSoC)
Open Source Contributor
Jun 2026 to presentContributing to a production open-source codebase through Social Summer of Code, one of the world's biggest open-source programs. 15+ pull requests merged so far, shipping features, bug fixes, and refactors reviewed by maintainers, plus issue triage and code review across the repo.
-
UT Dallas · Dr. Vincent NgArtificial Intelligence Researcher
Aug 2025 to Dec 2025Built the Python annotation tooling and labeling workflow for a multimodal image-text misinformation benchmark, enforcing schema-validated labels and consistency across annotators. Engineered end-to-end dataset collection, validation, and versioning over 10,000+ image-text pairs into a reusable benchmark for evaluating vision-language models.
-
Nebula LabsFull Stack Engineer
May 2025 to Dec 2025Shipped the React and Next.js front-end for the Trends platform, which reached 12,000+ unique users and 1,600+ active during peak course registration. Split heavy UI into route-level chunks and optimized asset delivery, cutting page load times by 35%.
-
ACM UTDMachine Learning Researcher
Jan 2025 to May 2025Pushed Galaxy Zoo morphology accuracy +20% by fine-tuning VGG16/19 in TensorFlow with transfer learning and staged unfreezing. Built a repeatable ablation harness that ran 20+ controlled experiments across 6 unfreezing schedules and 10+ hyperparameter variants. Published in IEEE AIIoT with 15+ auto-generated plots.
-
AI Society at UTDAI Engineer
Aug 2024 to Dec 2024Built an Instagram Q&A chatbot with Llama and Flask that answered event questions from posts, cutting manual lookups by 55%. Shipped the ingestion stack that ETL'd 2,000+ posts with BeautifulSoup, and mentored 100+ users on LangChain and Flask service patterns.
-
GDSCVP Project Development
Aug 2024 to Dec 2024Ran AI/ML and cloud computing workshops for 40+ people. Helped 100+ club members get unstuck on Google Cloud, G Suite, and Android SDK, and mentored peers on LangChain, Flask microservices, and NLP integration.
What I work with
languages
- Java
- Python
- Rust
- C#
- JavaScript
- TypeScript
- C
- C++
- SQL
- HTML
- CSS
frameworks
- React
- Next.js
- Node.js
- FastAPI
- PyTorch
- TensorFlow
- Scikit-learn
- NumPy
- Pandas
tools & infra
- Git
- Docker
- Kubernetes
- AWS
- Azure
- Linux
- PostgreSQL
- MongoDB
- HuggingFace
Things I've built
Velix
A visual-first retrieval pipeline that indexes document pages directly with ColQwen2 multi-vector embeddings in a Qdrant MaxSim index, skipping OCR so search survives handwriting, stamps, and 1800s manuscript scans. A confidence-gated 4-tier OCR fallback and schema-typed extraction with Qwen2.5-VL enforce domain rules as field-level constraints over a 3,844-page corpus, backed by 56 passing tests.
Cursorly
A Windows desktop AI agent that guides you with a live ghost cursor read off the UI Automation tree, not screenshots, driven by an always-replan loop of one typed Pydantic action per step over an OpenRouter LLM while holding under 0.1% idle CPU. FastAPI and Supabase backend with row-level security, JWT, SSRF guards, and 350+ Python and Rust tests.
-
03
TryEZ
Diffusion · LoRA · IP-Adapter · Modal B2B virtual try-on engine handling 10K+ try-ons. Renders garments onto user photos at 768x1024 through a Stable Diffusion + IP-Adapter inpainting pipeline driven by a from-scratch LoRA trained on Kaggle T4x2 GPUs, served on Modal GPUs behind a FastAPI and Neon backend that processes photos in memory with zero storage. -
04
FrameSeek
FastAPI · React · Whisper · CLIP Search inside a video for the exact moment you want. Scene-aware indexing plus hybrid text and visual retrieval over Whisper transcripts and CLIP keyframes returns precise timestamps with preview clips. -
05
Windows Caps Shortcut
AutoHotkey · Python Global hotkeys to fix the case of selected text on Windows. Smart Repair preserves brand names and acronyms; also covers Title Case, snake_case, camelCase, and toggle. Works in any app that supports copy/paste. -
06
StegnoKit
Python · AES-GCM · LSB Hides encrypted payloads inside PNG and WAV files using keyed, PRNG-permuted LSB placement plus AES-256-GCM for integrity. -
07
Subnautix
C++ · Rust · Python AUV navigation stack with weighted A* path planning, online replanning, and a Kalman filter for state estimation. -
08
ArticulateAI
Whisper · MFCC · FastAPI Speech-scoring service that transcribes you and grades your grammar and pronunciation with per-phoneme feedback. -
09
ReelForge
Python · ffmpeg · TTS Takes a topic and outputs a finished short-form video. Script, voiceover, captions, and b-roll, all in one run. -
10
TradeSage
Python · LLM Stock research assistant. Pulls filings, news, and price data, then uses an LLM to draft a quick fundamentals brief so I can skim before a deeper dive. -
11
Reactify
Python · NLP Reddit sentiment analysis on a topic of your choice. Surfaces what people actually think instead of just what gets upvoted to the top. -
12
OutreachMate
n8n · Python Lead sourcing, enrichment, and outreach automation in one workflow. Built after I got tired of doing it by hand. -
13
CatalogIQ
Python · Search Product catalog search that understands what you actually mean. Embeddings over product copy plus a small reranker. -
14
Gotta Catch 'em All
Python · CV A small computer-vision project that classifies Pokémon from images. Mostly an excuse to play with image augmentation. -
15
QuizifyAI
Python · LLM Drop in any text or PDF, get clean MCQs out. Built it the night before a study session and kept improving it after. -
16
AskDoc
Python · RAG Ask questions over your own documents. Vector search plus an LLM, with citations so you can actually trust the answer.
Nothing here matches that filter. Try another.
Want to chat?
I read everything that lands in my inbox. The fastest way to reach me is email. If it's about an internship, recruiting, or just an interesting project, please send it over.
email bhumish.dayal3@gmail.com linkedin bhumish-dayal github BhumishDayal