Siddharth Ranjeet Umathe | AI & Data Science Portfolio

🏆 Best Software Engineering Project Award

GenAI System

AI-Powered Software Engineering System

AI Engineer · Team of 7

Developed and integrated a multi-module GenAI-powered academic and software engineering system designed to improve student learning, productivity, debugging, and academic workflow automation.

PythonFlask SQLAlchemyGemini API LangChainChromaDB RAGPrompt Engineering

Problem

Students often spend significant time navigating lecture content, generating notes, revising weekly material, debugging code, and preparing for assessments. The goal was to build a unified AI-powered system that could assist across these workflows in a structured and usable way.

Approach

Worked as AI Engineer in a 7-member team, contributing to the design and integration of multiple GenAI modules. Built workflows around LLM APIs, prompt engineering, retrieval-based context grounding, backend orchestration, and modular AI pipelines.

Modules Built

AI chatbot with contextual assistance
Video / lecture summarisation
Week-level summarisation
Topic notes generation
Practice question generation
Mock quiz generation
Coding assistant for error explanation
Topic recommendation support

Technical Depth

The system involved transcript processing, content chunking, LLM prompting, vector database retrieval using ChromaDB, structured outputs, backend API integration, and response reliability handling. The focus was on building usable AI workflows that fit into an academic software platform, not just generating responses.

Key Learning

Learned how to move from isolated GenAI prompts to integrated AI systems involving data flow, context management, reliability, modularity, and user-facing workflows.

Speech AI

Automatic Speech Recognition Systems

ASR Pipelines · Self-supervised Models

Built and optimised end-to-end ASR pipelines using self-supervised speech representation models including wav2vec2.0, HuBERT, and Whisper, with focus on audio preprocessing, CTC decoding, and GPU-efficient training.

PyTorchwav2vec2.0 HuBERTWhisper HuggingFaceCTC Decoding

Problem

Speech recognition systems must handle variable-length audio, noisy samples, inconsistent sampling rates, and alignment between audio signals and text tokens. The goal was to build stable ASR pipelines capable of preprocessing speech data, training/using models, decoding outputs, and analysing transcription quality.

Approach

Worked on audio preprocessing, waveform normalisation, tokenisation, batching, padding, alignment strategies, and GPU-efficient training workflows. Implemented CTC-based decoding and encoder-based ASR architectures while experimenting with representation learning strategies for transcription robustness.

Technical Depth

Designed preprocessing pipelines for variable-length audio, sampling-rate normalisation, memory-conscious batching, tokenisation, dynamic padding, model loading, and decoding. Used Hugging Face Transformers and PyTorch-based pipelines for model experimentation.

Key Learning

Gained practical exposure to speech representation learning, sequence modeling, CTC alignment, encoder-based architectures, inference stability, and ASR system design.

NLP Fine-Tuning

Google Gemma Fine-Tuning with LoRA / PEFT

LLM Adaptation · Parameter-Efficient Methods

Fine-tuned Google Gemma large language models for domain-specific NLP tasks using parameter-efficient fine-tuning techniques including LoRA and PEFT workflows within the Hugging Face ecosystem.

PyTorchHuggingFace PEFTLoRA GemmaInstruction Tuning

Problem

Large language models often need adaptation for specific tasks, domains, or response styles. Full fine-tuning can be computationally expensive, so parameter-efficient techniques are useful for improving model behavior while reducing resource requirements.

Approach

Designed training pipelines involving dataset preprocessing, prompt formatting, tokenisation, batching, LoRA configuration, training configuration tuning, and inference testing. Focused on instruction-following behavior, domain adaptation, response consistency, and efficient deployment.

Technical Depth

Worked with Hugging Face Transformers, PEFT, PyTorch, tokenizer pipelines, prompt engineering, instruction-tuning workflows, and GPU-accelerated training environments.

Key Learning

Developed practical understanding of LLM fine-tuning, parameter-efficient adaptation, transformer behavior, prompt formatting, inference optimisation, and model robustness evaluation.

Computer Vision

4× Image Super Resolution

Computer Vision Competition · CNN Architectures

Developed deep learning based image super-resolution systems focused on reconstructing high-quality images from low-resolution visual inputs using convolutional neural network architectures and residual learning.

PyTorchCNN Residual LearningPSNR/SSIM Perceptual Loss

Problem

Image super-resolution requires recovering fine-grained spatial detail and improving perceptual quality from low-resolution inputs. The challenge is to improve sharpness and texture consistency without introducing artifacts.

Approach

Built 4× image super-resolution pipelines using CNN-based architectures, residual learning concepts, feature extraction, perceptual loss optimisation, and adversarial training concepts.

Technical Depth

Worked on image resizing, normalisation, patch-based training, augmentation, GPU-efficient workflows, training stability, and inference optimisation. Evaluated results using PSNR, SSIM, and perceptual quality analysis.

Key Learning

Gained exposure to image restoration, CNN architectures, perceptual learning, adversarial optimisation concepts, and computer vision experimentation.

Generative AI · Vision

GAN-Style Architecture for Generative Image Modeling

Generative AI Competition · Adversarial Training

Developed and trained GAN-style architectures for generative image modeling tasks, focused on producing realistic image outputs from encoded datasets, with careful attention to training stability and mode diversity.

PyTorchGANs Adversarial TrainingLatent Space FID Evaluation

Problem

Generative image modeling requires stable adversarial training and maintaining diversity while improving realism. GANs are difficult to train due to mode collapse, generator-discriminator imbalance, and unstable loss behavior.

Approach

Built end-to-end generative pipelines involving dataset preprocessing, image decoding workflows, training data organisation, augmentation, generator-discriminator optimisation, and GPU-accelerated training.

Technical Depth

Worked on encoded image shard datasets, efficient loading, normalisation, batching, latent-space behavior, generator capacity tuning, discriminator balancing, regularisation strategies, and FID-style evaluation concepts.

Key Learning

Gained practical exposure to adversarial learning, latent representation learning, training stabilisation, GPU-intensive experimentation, and generative modeling workflows.

Data Science

Business Data Management · Native Chefs

Applied Data Analysis · B2C Business Insights

Analysed real-world business data from Native Chefs, a B2C home-cooked food delivery business, to identify operational and revenue-related insights around unpaid orders, dish performance, and customer behavior.

PythonPandas MatplotlibEDA Business Analytics

Problem

The business needed better visibility into unpaid orders, dish-level performance, customer ordering behavior, and revenue leakage.

Approach

Performed data cleaning, preprocessing, descriptive statistics, exploratory analysis, pivot-based summaries, dashboarding, and business interpretation.

Focus Areas

Revenue leakage through unpaid orders
Dish-level performance analysis
Customer ordering behavior patterns
Revenue trend identification
Operational improvement recommendations

Key Learning

Gained practical experience in turning raw business data into decision-support insights, bridging the gap between data exploration and actionable business recommendation.

Siddharth Umathe

Who Am I

Academic Background

Capabilities

Projects

AI-Powered Software Engineering System

Problem

Approach

Modules Built

Technical Depth

Key Learning

Automatic Speech Recognition Systems

Problem

Approach

Technical Depth

Key Learning

Google Gemma Fine-Tuning with LoRA / PEFT

Problem

Approach

Technical Depth

Key Learning

4× Image Super Resolution

Problem

Approach

Technical Depth

Key Learning

GAN-Style Architecture for Generative Image Modeling

Problem

Approach

Technical Depth

Key Learning

Business Data Management · Native Chefs

Problem

Approach

Focus Areas

Key Learning

Academic Training

Research Focus

Highlights

Get In Touch