About

About Me

I'm a Machine Learning Engineer at Chubb Business Services India, where I develop and deploy cutting-edge ML pipelines and AI solutions. I completed my undergraduate from BITS Pilani, Pilani Campus majoring in Electrical and Electronics Engineering.

With expertise in Large Language Models, NLP, and scalable ML systems, I've contributed to reducing model inference latency by 40% and improving throughput by 50% through advanced optimization techniques. My work spans from fine-tuning LLaMA-based models to developing production-grade REST APIs and implementing robust CI/CD processes for ML deployments.

I'm passionate about leveraging AI to solve real-world problems and have published research in top-tier conferences including ACL, AACL-IJCNLP, and ACIIDS. My recent projects include AI-driven recipe generation and resume parsing systems that demonstrate the practical applications of modern AI technologies.

Experience

Professional Experience & Education

July 2023 - Present

Machine Learning Engineer

Chubb Business Services India, Hyderabad

• Developed and deployed an end-to-end ML pipeline that fine-tuned a LLaMA-based large language model on domain-specific data, resulting in 25% improvement in inference accuracy and a 15% reduction in model drift
• Engineered a high-performance inference service using vLLM, achieving a 40% reduction in latency and a 50% increase in throughput by leveraging dynamic batching and GPU memory optimizations
• Developed and deployed a Spring Boot-based RESTful API for data mastering, enrichment, and certification, leveraging Maven for streamlined dependency management and advanced caching to improve performance
• Integrated robust CI/CD processes, monitoring, and automated scaling strategies to ensure continuous model improvement and reliable production deployments across cloud-based environments

June 2022 - June 2023

NLP Research Intern

Speech Lab
Nanyang Technological University, Singapore

• Developed a machine learning language model tailored for English-Malay code-switched data, achieving a 20% improvement in accuracy over baseline models by implementing advanced statistical and neural augmentation techniques
• Integrated linguistically informed algorithms—including part-of-speech tagging and grammatical coherence—to enhance multilingual NLP robustness and advance the state-of-the-art in code-switching language processing

Summer' 21

App Developer Intern

Army Base Workshop, Pithoragarh

Built a web app that keeps track of inventory of Army Camp in a Digital Ledger through an intuitive UI

2019-2023

Bachelors of Engineering

Electrical and Electronics Engineering
BITS Pilani, Pilani Campus

Resume

Publications

WhoDunit: Evaluation benchmark for culprit detection in mystery stories

ACL ARR December 2024

Arxiv

MALM: Mixing Augmented Language Modeling for Zero-Shot Machine Translation

NLP4DH, AACL-IJCNLP 2022

Taiwan

Adapting Code-Switching Language Models with Statistical-Based Text Augmentation

ACIIDS 2023

Thailand

Singaporean Conversational English-Malay Code-Switching Points

IALP 2023

Singapore

Data Augmentation for Automated Essay Scoring using Transformer Models

AISC 2023

India

Assemble AI

Open Source LLM Tools Initiative

Assemble AI is my open-source initiative dedicated to leveraging AI for real-world problems. Through this platform, I create innovative LLM-powered tools that demonstrate the practical applications of modern AI technologies, from personalized content generation to intelligent data processing.

SmartChef

AI-Driven Personalized Recipe Generator

SmartChef is an intelligent recipe generation system that utilizes GPT and advanced NLP techniques to craft custom recipes based on available ingredients, cuisine preferences, dietary restrictions, and nutritional needs.

Key Features:

Personalized recipe recommendations using advanced AI algorithms
Dietary restriction and nutritional optimization
Multi-cuisine support with cultural authenticity
Ingredient substitution suggestions
Scalable deployment via Docker and cloud infrastructure

Technologies: GPT, OpenAI API, Python, NLP, Docker, AWS

View Project

Insight

GPT-Powered Resume Parser

Insight is an advanced resume parsing system that leverages GPT technology to extract structured insights from diverse resume formats, significantly improving recruitment efficiency and candidate recommendations.

Key Achievements:

50% improvement in shortlisting efficiency
Enhanced candidate recommendations through multi-modal data integration
Robust parsing across multiple resume formats and languages
Intelligent skill matching and gap analysis
Seamless integration with existing HR workflows

Technologies: GPT, OpenAI API, Python, Multi-modal AI, Data Processing

View Project

Interested in collaborating on LLM tools or learning more about these projects?

Explore on GitHub

Skills

Technical Expertise

Programming Languages

Python, Java, C, C++, C#, JavaScript

AI/ML Technologies

PyTorch, TensorFlow, Transformers, Hugging Face, vLLM, Keras

Cloud & Infrastructure

AWS, Azure, Docker, Kubernetes, Git, GitHub

Frameworks & Tools

Flask, Spring Boot, Databricks, Maven, LaTeX

Databases

SQL, RDS

Specializations

Large Language Models, NLP, Computer Vision, MLOps

I'm Kshitij Gupta

Machine Learning Engineer

About

About Me

Experience

Professional Experience & Education

Machine Learning Engineer

NLP Research Intern

App Developer Intern

Bachelors of Engineering

Publications

Publications

ACL ARR December 2024

NLP4DH, AACL-IJCNLP 2022

ACIIDS 2023

IALP 2023

AISC 2023

Assemble AI

Open Source LLM Tools Initiative

SmartChef

Insight

Skills

Technical Expertise

Programming Languages

AI/ML Technologies

Cloud & Infrastructure

Frameworks & Tools

Databases

Specializations

Interests

My Interests

Natural Language Processing

Computer Vision

Big Data Analysis

Probabilistic Machine Learning

Transfer Learning

Deep Learning

Supervised Learning

Unsupervised Learning

Reinforcement Learning

Deductive Inference

Projects

My Projects