Abhishek Jain

Data Scientist with 7+ years of experience applying Python, SQL, statistical modelling, machine learning, to solve large-scale business problems in data-intensive environments. Experienced in building end-to-end explainable models, translating business problems into analytics solutions, and deploying scalable data products. A strong communicator with a proven track record of driving data-driven decision-making in Agile teams.

📄 View My Resume
Abhishek Jain

Featured Work

⚡ Smart PDF RAG Chatbot

A lightning-fast Streamlit application powered by Groq (Llama-3.3-70B), LangChain, and local FAISS vector storage. Allows users to query complex PDFs with zero API costs.

Python LangChain Groq GenAI Streamlit

🔍 Why Your RAG Pipeline is Failing

Moving beyond basic LangChain tutorials. A deep dive into semantic chunking, hybrid search (BM25 + Vector), and cross-encoder re-ranking.

Generative AI Information Retrieval RAG NLP

📈 Energy Demand Forecasting

Built and evaluated custom transformer model for time series prediction. Comparison with baselines like ARIMA, XGboost and advanced models like LSTMs, Bi-LSTMs, GRU, TSmixer to validate the efficiency.

PyTorch Transformers ARIMA XGboost LSTM Bi-LSTM GRU TSmixer TFT Time-Series

📊 Anatomy of AI Evaluation

A deep dive into model evaluation. Moving beyond basic accuracy to explore Precision, Recall, Cross-Entropy Loss, and modern LLM/RAG metrics.

Machine Learning NLP LLMs Statistics

🧠 Math Behind Transformers

Transformers look scary, but they are mostly matrix multiplication! A breakdown of the linear algebra and non-linear functions powering LLMs.

Deep Learning Mathematics LLMs

🚀 Serving ML Models at Scale

Beyond the Jupyter notebook. A practical guide to wrapping models in FastAPI, containerizing with Docker, and establishing MLOps monitoring.

MLOps FastAPI Docker System Design

💼 ML Metrics to Business ROI

Bridging the gap between engineering and the boardroom. A guide to converting Precision, Recall, and Model Thresholds into actual dollar amounts.

Business Strategy ROI Modeling Stakeholder Comm. Product Analytics

Technical Arsenal

Programming & Cloud

Python SQL R AWS Azure

Machine Learning & Stats

Supervised & Unsupervised Learning Time Series (ARIMA, SARIMAX, VAR) XGBoost & CatBoost A/B Testing PCA & Clustering

Deep Learning & GenAI

PyTorch Transformer Models (LLaMA, BERT, ViT) RAG & Vector DBs (Pinecone, Milvus) LangChain & Hugging Face PEFT/LoRA Fine-tuning NLP & Neural Networks

MLOps & Visualization

Databricks Docker & CI/CD Git Flask & FastAPI Tableau & Power BI QuickSight