0m surface
01 about

A little about me

Bhumish, off the clock

Hi, I'm Bhumish. I'm a CS student at UT Dallas, focused on machine learning and fullstack applications. My time so far has been split across research, side projects, and a bit of open source.

This past year I worked with Dr. Vincent Ng on multimodal misinformation, building the annotation tooling and a 10,000-pair image-text benchmark for evaluating vision-language models on misleading content.

Right now I'm a software engineering intern at FloCareer, working on generative face-replacement and a data-anonymization pipeline. On the side I build products like Cursorly and TryEZ. Outside school, I'm probably playing badminton or practicing guitar.

16+ projects built
8 roles held
1 IEEE paper
say hi
02 education

Where I study

UT Dallas

The University of Texas at Dallas

B.S. Computer Science, GPA 3.5/4.0 expected May 2027

coursework

Advanced data structures & algorithms, machine learning, linear algebra, probability & statistics, operating systems, databases, discrete math.

around campus

Machine Learning Researcher at ACM, AI Engineer at AI Society, VP Project Development at GDSC, and president of the UTD badminton club.

certifications

Building Language Models on AWS (AWS), Google AI Essentials (Google), and Build an AI Agent (IBM SkillsBuild).

03 experience

What I've worked on

  1. F

    Software Engineering Intern

    Jun 2026 to present
    FloCareer · Remote

    Building a generative face-replacement stage that swaps identity while preserving expression, gaze, and head pose across a 1M-recording, 750K-hour corpus, keeping the behavioral signal intact for training. Architecting the 8-stage anonymization pipeline across Whisper, pyannote, Presidio NER, and ARX k-anonymity, modeling compute down to a $25K to $165K core from a $0.9M+ baseline.

  2. P

    AI Engineering Extern

    Jan 2026 to May 2026
    Pfizer

    Built a retrieval-augmented generation system over 5,000+ clinical-supply documents, chunking and embedding them into a vector database with LlamaIndex for natural-language search across pharmaceutical PDFs. Added an ensemble OCR pipeline on Tesseract, PaddleOCR, and EasyOCR that pulls structured fields from messy labels and logistics forms, served through a Gradio chatbot for real-time document Q&A.

  3. S

    Open Source Contributor

    Jun 2026 to present
    Social Summer of Code (SSoC)

    Contributing to a production open-source codebase through Social Summer of Code, one of the world's biggest open-source programs. 15+ pull requests merged so far, shipping features, bug fixes, and refactors reviewed by maintainers, plus issue triage and code review across the repo.

  4. Artificial Intelligence Researcher

    Aug 2025 to Dec 2025
    UT Dallas · Dr. Vincent Ng

    Built the Python annotation tooling and labeling workflow for a multimodal image-text misinformation benchmark, enforcing schema-validated labels and consistency across annotators. Engineered end-to-end dataset collection, validation, and versioning over 10,000+ image-text pairs into a reusable benchmark for evaluating vision-language models.

  5. Full Stack Engineer

    May 2025 to Dec 2025
    Nebula Labs

    Shipped the React and Next.js front-end for the Trends platform, which reached 12,000+ unique users and 1,600+ active during peak course registration. Split heavy UI into route-level chunks and optimized asset delivery, cutting page load times by 35%.

  6. Machine Learning Researcher

    Jan 2025 to May 2025
    ACM UTD

    Pushed Galaxy Zoo morphology accuracy +20% by fine-tuning VGG16/19 in TensorFlow with transfer learning and staged unfreezing. Built a repeatable ablation harness that ran 20+ controlled experiments across 6 unfreezing schedules and 10+ hyperparameter variants. Published in IEEE AIIoT with 15+ auto-generated plots.

  7. AI Engineer

    Aug 2024 to Dec 2024
    AI Society at UTD

    Built an Instagram Q&A chatbot with Llama and Flask that answered event questions from posts, cutting manual lookups by 55%. Shipped the ingestion stack that ETL'd 2,000+ posts with BeautifulSoup, and mentored 100+ users on LangChain and Flask service patterns.

  8. VP Project Development

    Aug 2024 to Dec 2024
    GDSC

    Ran AI/ML and cloud computing workshops for 40+ people. Helped 100+ club members get unstuck on Google Cloud, G Suite, and Android SDK, and mentored peers on LangChain, Flask microservices, and NLP integration.

04 stack

What I work with

languages

  • Java
  • Python
  • Rust
  • C#
  • JavaScript
  • TypeScript
  • C
  • C++
  • SQL
  • HTML
  • CSS

frameworks

  • React
  • Next.js
  • Node.js
  • FastAPI
  • PyTorch
  • TensorFlow
  • Scikit-learn
  • NumPy
  • Pandas

tools & infra

  • Git
  • Docker
  • Kubernetes
  • AWS
  • Azure
  • Linux
  • PostgreSQL
  • MongoDB
  • HuggingFace
05 projects

Things I've built

  1. 03

    TryEZ

    Diffusion · LoRA · IP-Adapter · Modal
    B2B virtual try-on engine handling 10K+ try-ons. Renders garments onto user photos at 768x1024 through a Stable Diffusion + IP-Adapter inpainting pipeline driven by a from-scratch LoRA trained on Kaggle T4x2 GPUs, served on Modal GPUs behind a FastAPI and Neon backend that processes photos in memory with zero storage.
    diffusionloratry-on live demo
  2. 04

    FrameSeek

    FastAPI · React · Whisper · CLIP
    Search inside a video for the exact moment you want. Scene-aware indexing plus hybrid text and visual retrieval over Whisper transcripts and CLIP keyframes returns precise timestamps with preview clips.
    vector-searchasrvideo live demo
  3. 05

    Windows Caps Shortcut

    AutoHotkey · Python
    Global hotkeys to fix the case of selected text on Windows. Smart Repair preserves brand names and acronyms; also covers Title Case, snake_case, camelCase, and toggle. Works in any app that supports copy/paste.
    autohotkeyhotkeyswindows github
  4. 06

    StegnoKit

    Python · AES-GCM · LSB
    Hides encrypted payloads inside PNG and WAV files using keyed, PRNG-permuted LSB placement plus AES-256-GCM for integrity.
    steganographyaes-gcmpngwav live demo
  5. 07

    Subnautix

    C++ · Rust · Python
    AUV navigation stack with weighted A* path planning, online replanning, and a Kalman filter for state estimation.
    kalmana-starcontrols github
  6. 08

    ArticulateAI

    Whisper · MFCC · FastAPI
    Speech-scoring service that transcribes you and grades your grammar and pronunciation with per-phoneme feedback.
    asrspeechnlp github
  7. 09

    ReelForge

    Python · ffmpeg · TTS
    Takes a topic and outputs a finished short-form video. Script, voiceover, captions, and b-roll, all in one run.
    videottsffmpeg github
  8. 10

    TradeSage

    Python · LLM
    Stock research assistant. Pulls filings, news, and price data, then uses an LLM to draft a quick fundamentals brief so I can skim before a deeper dive.
    financeragllm github
  9. 11

    Reactify

    Python · NLP
    Reddit sentiment analysis on a topic of your choice. Surfaces what people actually think instead of just what gets upvoted to the top.
    nlpredditsentiment github
  10. 12

    OutreachMate

    n8n · Python
    Lead sourcing, enrichment, and outreach automation in one workflow. Built after I got tired of doing it by hand.
    automationscrapingemail github
  11. 13

    CatalogIQ

    Python · Search
    Product catalog search that understands what you actually mean. Embeddings over product copy plus a small reranker.
    searchembeddingse-commerce github
  12. 14

    Gotta Catch 'em All

    Python · CV
    A small computer-vision project that classifies Pokémon from images. Mostly an excuse to play with image augmentation.
    cnnclassificationfun github
  13. 15

    QuizifyAI

    Python · LLM
    Drop in any text or PDF, get clean MCQs out. Built it the night before a study session and kept improving it after.
    educationllmpdf github
  14. 16

    AskDoc

    Python · RAG
    Ask questions over your own documents. Vector search plus an LLM, with citations so you can actually trust the answer.
    ragembeddingsqna github
06 contact

Want to chat?

I read everything that lands in my inbox. The fastest way to reach me is email. If it's about an internship, recruiting, or just an interesting project, please send it over.

email bhumish.dayal3@gmail.com linkedin bhumish-dayal github BhumishDayal

or drop a message here