// research engineer

HARSH KHANDELWAL

focus: NLP & Generative AI · domain: Clinical AI Systems

Fine-tuning Large Language Models, knowledge distillation, and building efficient real-world medical AI systems that matter.

scroll
01

About

I'm an ML researcher pushing the boundaries of Natural Language Processing and Generative AI. My work lives at the intersection of cutting-edge research and real-world clinical applications.

Currently focused on fine-tuning Large Language Models, knowledge distillation techniques, and building AI systems that help medical professionals make better decisions. I believe the most impactful AI is the kind that works quietly in the background, augmenting human expertise rather than replacing it.

Beyond the terminal: history enthusiast, hiker, and traveler. The best ideas often come from stepping away from the screen.

education
2024—26 M.Sc. Autonomy Technologies FAU Erlangen-Nuremberg
2016—20 B.Tech. Electrical & Electronics NIT Delhi
core competencies
LLM Fine-tuning Knowledge Distillation RAG Systems Medical Imaging Vector Embeddings PyTorch
02

Experience

Oct 2025 — Present industry

Data Science Working Student

Siemens Energy, Nuremberg
  • Automated Snowflake ETL pipeline ingesting SAP data, enabling real-time Tableau dashboards
  • Reduced manual data retrieval by 1 hour per analysis for project managers
  • Created Power BI visualization dashboards, streamlining team workflows
Jul 2025 — Present research

AI Research Collaborator

ScaDS.AI Lab, TU Dresden
  • Contributed to research on bilingual language conditioning in LLMs published at IJCNLP-AACL 2025
  • Analyzed model interpretability in Llama3-70B and Qwen-72B
  • Developing universal pipeline for evidence-based retrieval-generation system
Feb — Sept 2025 industry

AI Workflow Intern

National Instruments, Emerson — Dresden
  • Awarded performance bonus for 2 invention disclosures submitted to patent committee
  • Developed manufacturing debug assistant with GenAI and tool-calling agents
  • Achieved 80%+ accuracy on 90% of test failures
  • Implemented data drift detection pipeline using statistical tests
Jun — Dec 2024 research

Student Assistant ML/Radar

Inst. of Microwaves and Photonics, FAU
  • Developed ML-enabled radar-based human pose estimation system
  • Conducted infrared and radar motion capture studies
  • Optimized RNN inference speed by evaluating lightweight architectures
2021 — 2023 industry

Consultant → Associate Consultant

Adnate IT Solutions Ltd, Jaipur
03

Projects

PRJ-001 Medical AI

RAG Pipeline for XNAT-based PACS

Developed a RAG prototype for an XNAT-based PACS system targeting deployment at University Klinikum Erlangen. Processed DICOM and NIfTI CT files to populate PostgreSQL with pgvector, enabling vector similarity search on medical image embeddings.

Embeddings Medical Imaging PostgreSQL pgvector
PRJ-002 VLM

Medical Report Generation Model

Dockerized a Vision-Language Model for CT scan report generation at University Klinikum Erlangen. Built preprocessing pipeline for CT-Scans and translation layer for English to German medical reports.

Docker VLM Medical Imaging NMT
PRJ-003 NLP

GRITLM Fine-tuning for Evidence Retrieval

Fine-tuned GRITLM embedding model for evidence-based retrieval generation. Created datasets for CheckThat!2025 challenge in preparation for CLEF 2026.

Fine-tuning RAG Multi-language
PRJ-005 RecSys

Content-Based Podcast Recommendation

Reduced data processing time by 50% with Python pipeline processing 10K+ RSS feeds using parallel operations. Achieved 85% accuracy with sentence-transformer embeddings and cosine similarity matching.

Vector Embeddings Semantic Search Python
04

Contact

Ready to collaborate on cutting-edge AI projects?
Let's build something that matters.