Currently · Associate AI Engineer @ Fx31 Labs

Building neural
systems that think,
reason, and ship.

I'm Yog Prajapati — an AI Engineer focused on building practical AI products using LLMs, RAG systems, and modern AI workflows. I work on transforming AI ideas into real-world applications by combining backend engineering, intelligent retrieval systems, and scalable product architecture. I have experience building production-ready AI platforms, full-stack applications, and AI-powered user experiences.

5+
AI & Full-Stack Projects Built
15+
Production-Ready Features & Integrations
AI Systems
Built with LLMs, RAG & Modern AI Workflows
~/yog/model.pt
forward · t=0.142s
loss · 0.0271
online · Ahmedabad, IN
UTC+5:30
role
AI
focus
LLMs · RAG · Eval
stack
Python · React · Next.js · LangChain · FastAPI
builds
AI Products · Full-Stack Platforms · Production Systems
PyTorch TensorFlow JAX Hugging Face LangChain Weights & Biases Ray vLLM Pinecone Weaviate Triton ONNX CUDA AWS GCP Modal PyTorch TensorFlow JAX Hugging Face LangChain Weights & Biases Ray vLLM Pinecone Weaviate Triton ONNX CUDA AWS GCP Modal
About

Engineer who treats models like software.

I started in research, drifted into systems, and now I sit happily at the seam. I care about evaluation as much as architecture and shipping as much as state-of-the-art.

Open to collabs
default activationGELU

“A model that ships on Tuesday beats a model that benchmarks on Friday.”

— a mentor, c. 2022
Modeling
/modeling
  • LLM fine-tuning (SFT, DPO)
  • Transformers
  • Diffusion
  • Multi-modal
Systems
/systems
  • Distributed training
  • vLLM / TGI serving
  • Vector retrieval
  • Quantization
MLOps
/mlops
  • Eval harnesses
  • Telemetry
  • CI for models
  • Cost / latency tuning
Backend
/backend
  • Python · Go
  • FastAPI
  • Postgres
  • Kafka · gRPC
Beyond the keyboard
Climber, slow-coffee enthusiast, occasional speaker.
Coffee/Day · 2.5Bouldering · V6
Selected work

Projects that earned their place in prod.

A small slice — open-source experiments, internal platforms, and one or two papers that became products.

All projects
PROJECT_01

MultiModal RAG System

AI-powered multimodal retrieval and reasoning platform

RAG
multimodal retrieval
LangGraph · LangChain · Vector Retrieval
live

Built a production-ready multimodal RAG system using LangGraph and LangChain capable of processing programming languages, Github PRs, documentation, and structured data with intelligent retrieval, contextual reasoning, semantic search, conversation memory, and AI-powered response generation workflows.

PythonFastAPILangChainLangGraphFAISSMongoDBOpenAI APIsTailwind CSS
PROJECT_02

HireSight

AI-powered resume ATS analyzer and interview preparation platform

ATS
resume analysis
LLM · Resume Intelligence
live

Built an interview preparation platform featuring mock tests, AI-powered ATS resume analysis, skill assessments, and personalized feedback to help candidates improve job readiness and resume quality.

ReactNode.jsExpressMongoDBTailwind CSSAI/LLM APIs
PROJECT_03

TalentPilot

AI-powered HR automation and candidate screening platform

AI
candidate screening
LLM · Hiring Pipeline
live

Built an intelligent hiring workflow platform that analyzes resumes using AI, categorizes candidates, automates screening, and helps recruiters make faster hiring decisions using Azure OpenAI and Gmail integrations.

ReactNode.jsMongoDBAzure OpenAIRedisGmail API
PROJECT_04

iPharm

AI-powered pharmacy and medicine recommendation platform

5000+
SKU support
ML · Recommendation Engine
live

Developed a scalable multi-vendor pharmacy platform with pincode-based medicine discovery, AI-powered recommendation engine, real-time user tracking, secure payments, and Socket.IO integration.

ReactNode.jsMongoDBPython MLRedisSocket.IO
PROJECT_05

Leaf AI

Deep learning-based potato leaf disease detection system

CNN
disease classification
CNN · Image Classification
live

Created a real-time plant disease detection platform using Deep Learning and computer vision. Integrated image preprocessing, intelligent validation, and CNN-based classification for accurate disease prediction.

TensorFlowFastAPIPythonNext.jsOpenCV
PROJECT_06

Intellixor

AI tools discovery and review platform

Full
stack platform
AI · Discovery Platform
live

Built a production-ready AI tools directory with authentication, bookmarking, reviews, admin dashboard, SEO optimization, and category-based AI tool discovery using modern full-stack architecture.

React.jsTypeScriptNode.jsSupabaseTailwind CSS
Neural Studio

The architectures I think in.

A visual index of the model families I work with daily — from the humble MLP to the transformer block that ate the world.

01 · SELF-ATTENTION
softmax(QKᵀ / √dₖ) · V

Scaled dot-product attention

Every token gets to query every other token. Heavy upfront — O(n²) — but parallelizable, and the inductive bias that unlocked LLMs.

queries
Q→ “the
keys
K→ “model
values
V→ “learns
02 · MLP
y = σ(Wx + b)

Feed-forward, the workhorse

Stacked linear + nonlinearity. Old, simple, and 60% of the FLOPs in every transformer block.

03 · CNN
conv(x, W) + b, stride=1, pad=same

Convolutional feature maps

Local receptive fields, weight sharing, hierarchy. Still the right answer for pixels.

04 · RNN
hₜ = tanh(W·xₜ + U·hₜ₋₁ + b)

Recurrence over time

A single cell, reused. Beautiful in theory; you'll meet vanishing gradients on the way home.

05 · LSTM
C̃ₜ = tanh(W_c · [hₜ₋₁, xₜ])

Gates that remember

Forget, input, output. Add a carry line and suddenly long-range dependencies are tractable.

06 · TRANSFORMER
Vaswani et al., 2017

Block × N

Attention + FFN + residual + norm. Repeat until you have a personality.

07 · OPTIMIZATION
θ ← θ − η · ∇L(θ)

Gradient field

Backprop, one step at a time. Every arrow is a parameter that just got slightly less wrong.

08 · DECODING
temperature=0.7 · top_p=0.9

Token stream

What the model is doing right now, one piece at a time.

tokenize
embed
▁the
▁model
▁learns
logits[0]
0.124
softmax
▁to
▁predict
▁next
▁token
argmax
logits[1]
0.872
▁sample
temperature=0.7
▁top_p=0.9
▁repeat
EOS
kv_cache::hit
attn[0,12]=0.42
layer_norm
ffn
residual
→ output
tokenize
embed
▁the
▁model
▁learns
logits[0]
0.124
softmax
▁to
▁predict
▁next
▁token
argmax
logits[1]
0.872
▁sample
temperature=0.7
▁top_p=0.9
▁repeat
EOS
kv_cache::hit
attn[0,12]=0.42
layer_norm
ffn
residual
→ output
Experience · Education

A timeline, mostly linear.

Six years of work and learning. The interesting parts live in the bullets.

  1. Work· Jun 2026 — Present

    Associate AI Engineer

    Fx31 Labs
    • developing and deploying RAG systems, retrieval pipelines, LLM powered applications, and scalable backend services for real world AI solutions
    • Collaborating on production grade client projects and contributing to AI workflow automation and intelligent systems development.
    • Handling client interactions in AI solution design and implementation.
    PythonFastAPIMicroservicesDockerRedisAgentic Workflows(Langchain/Langgraph/n8n)LLMsRAG
  2. Work· Dec 2025 — May 2026

    AI Intern

    Fx31 Labs
    • Contributed to the development of RAG systems, retrieval pipelines, and LLM powered applications.
    • Gained hands-on experience in building scalable backend services and AI workflow automation.
    RAGLLMsPythonFastAPIDocker
  3. Education· 2023 — 2026

    B.Tech Computer Science and Engineering (Specialization in AI/ML)

    GLS University
    • Graduated with distinction. CGPA: 8.2/10.
    Data StructuresDeep LearningNLPComputer VisionReinforcement Learning
  4. Education· 2020 — 2023

    Diploma in Computer Engineering

    Gujarat Technological University
    • Completed with distinction. CGPA: 7.92/10.
    Operating SystemsNetworkingDBMSTroubleshootingCompetitive Programming
Certifications · Achievements

Receipts.

Selected credentials and recognitions, in case those matter to you.

Cert· 2025
Create a Dashboard with Power BI
GLS University | Koenig Solutions | Microsoft
Cert· 2025
Green Skills & Artificial Intelligence Foundation
GLS University - Skills4Future Program
Cert· 2024
What is Generative AI?
LinkedIn
Cert· 2024
Programming with JavaScript
Meta via Coursera
Cert· 2023
Python for Data Science, AI & Development
IBM via Coursera
Cert· 2023
Fundamentals of Java Programming
Board Infinity via Coursera
Tech stack

Tools I reach for first.

Pragmatic over fashionable. I'll happily swap any of these out if the problem says so.

AI / ML
12
  • Python
  • TensorFlow
  • PyTorch
  • Scikit-learn
  • Pandas
  • NumPy
  • OpenCV
  • Deep Learning
  • Machine Learning
  • NLP
  • Recommendation Systems
  • Computer Vision
LLM & GenAI
11
  • LangChain
  • LangGraph
  • RAG
  • LLM Integration
  • Prompt Engineering
  • Gemini API
  • Azure OpenAI
  • AI Agents
  • Vector Search
  • Embeddings
  • AI Workflow Automation
Frontend
8
  • React.js
  • Next.js
  • TypeScript
  • JavaScript
  • Tailwind CSS
  • HTML
  • CSS
  • Socket.IO
Backend
9
  • Node.js
  • Express.js
  • FastAPI
  • REST APIs
  • Microservices
  • WebSockets
  • Authentication
  • API Integration
  • Real-time Systems
Database & Storage
7
  • MongoDB
  • PostgreSQL
  • MySQL
  • Redis
  • Supabase
  • Firebase
  • Cloudinary
Cloud / DevOps
9
  • Docker
  • Git
  • GitHub
  • Postman
  • Render
  • Vercel
  • Azure
  • CI/CD
  • Deployment
Tools & Platforms
8
  • VS Code
  • Cursor AI
  • Figma
  • Jupyter Notebook
  • Swagger
  • Linux
  • npm
  • GitHub Actions
Contact

Let's build something useful.

Open to AI Engineering roles, applied AI collaborations, and building scalable AI-powered platforms using LLMs, RAG, and modern AI workflows. I usually respond within 48 hours.

Encrypted in transit · Never shared