👋 Hello, I’m

Rohan Patil

Building AI Systems
That Scale 🚀

AI/ML Engineer with experience at Perplexity and Amazon, building production-grade LLM pipelines, RAG systems, and distributed ML infrastructure for real-world high-scale environments.

Resume

How I build systems

AI / ML

Python

PyTorch

TensorFlow

Scikit-learn

NumPy

Pandas

LLM / GenAI

OpenAI

LangChain

LangGraph

RAG Systems

Embeddings

Data / Streaming

Kafka

Spark

Redis

Backend

FastAPI

APIs

Microservices

Infra

Docker

Kubernetes

Databases

PostgreSQL

MongoDB

Redis Cache

Selected Projects

Adaptive RAG Chatbot

Built an adaptive Retrieval-Augmented Generation (RAG) system that improves LLM reliability by dynamically selecting retrieval strategies based on query complexity. The system balances latency and accuracy by routing simple queries through lightweight retrieval while applying deeper contextual search and re-ranking for complex queries.

LangGraphQdrantFAISSMongoDBFastAPIGPT-4o

View Details →GitHub ↗Live Demo ↗

LENS — AI Image Intelligence

Developed a real-time multimodal AI system that processes images and generates contextual outputs across multiple modes including storytelling, humor, and analytical reasoning. The system leverages GPT-4o Vision with streaming responses to deliver an interactive and engaging user experience.

GPT-4o VisionNext.jsStreaming (SSE)Multimodal AI

View Details →GitHub ↗Live Demo ↗

Second Brain — Knowledge Graph

Built an AI-powered system that converts raw notes and text into structured knowledge graphs. The system extracts entities and relationships using LLMs and visualizes them as an interactive force-directed graph, enabling intuitive exploration of complex information.

GPT-4oD3.jsNext.jsKnowledge Graphs

View Details →GitHub ↗Live Demo ↗

Human Activity Recognition System

Built a lightweight human activity recognition system using 2D pose keypoints instead of raw video, enabling efficient sequence modeling with high accuracy. The system leverages temporal patterns using LSTM networks while significantly reducing computational overhead compared to RGB-based approaches.

TensorFlowLSTMOpenPoseFlask

View Details →GitHub ↗

Driver Drowsiness Detection System

Developed a real-time driver drowsiness detection system that monitors eye states using computer vision and deep learning. The system processes webcam input, detects facial regions, and classifies eye states to trigger alerts when fatigue is detected.

OpenCVCNNKerasPygame

View Details →GitHub ↗

Location Intelligence & Clustering System

Built a location intelligence system that analyzes geospatial and venue data to identify optimal neighborhoods. The system applies clustering techniques to group similar areas and provide insights for decision-making based on data patterns.

KMeansPandasScikit-learnGeospatial Analysis

View Details →GitHub ↗

Experience

AI/ML Engineer — Perplexity

June 2024 – Present · San Francisco, CA

• Architected RAG pipelines integrating vector search + web indexing.
• Built FAISS + BM25 hybrid retrieval with re-ranking improving Precision.
• Optimized Triton GPU inference → +25% throughput.
• Designed LLM routing (on-device + cloud) for sub-second latency.
• Improved factual consistency via ranking + citation pipelines.
• Built evaluation systems tracking latency, accuracy, UX metrics.
• Led 0→1 agentic AI features → +18% engagement.

AI/ML Engineer — Amazon

Oct 2019 – June 2023 · India

• Built batch + streaming pipelines using AWS, Spark, Kafka.
• Designed feature systems → +30% faster data access.
• Prevented training-serving skew in real-time ML systems.
• Built Kafka + Spark streaming pipelines for low latency updates.
• Orchestrated ML workflows with Airflow + SageMaker.
• Built drift detection + monitoring datasets.
• Reduced infra cost by ~15% via optimization.

Education

MS in Computer Science — Binghamton University

2023 – 2025 · New York, USA

GPA: 3.85

Tools & Technologies

AI / ML

PyTorchTensorFlowScikit-learnNumPyPandasTime SeriesModel Evaluation

LLM / GenAI

RAGLangChainLangGraphOpenAI APIsEmbeddingsPrompt EngineeringSemantic SearchLLM Evaluation

Data Engineering

Apache SparkKafkaAirflowETL PipelinesParquetStreaming SystemsFeature Engineering

Infrastructure & Cloud

AWSKubernetesDockerSageMakerRedshiftGPU InferenceTriton ServerCI/CDPrometheusGrafana

Let’s Build Something Great 🚀

I’m open to AI/ML Engineering roles, collaborations, and interesting problems. Feel free to reach out — I’d love to connect.

Email Me LinkedIn GitHub

Rohan Patil

Building AI Systems That Scale 🚀

Selected Projects

Adaptive RAG Chatbot

LENS — AI Image Intelligence

Second Brain — Knowledge Graph

Human Activity Recognition System

Driver Drowsiness Detection System

Location Intelligence & Clustering System

Experience

AI/ML Engineer — Perplexity

AI/ML Engineer — Amazon

Education

MS in Computer Science — Binghamton University

Tools & Technologies

AI / ML

LLM / GenAI

Data Engineering

Infrastructure & Cloud

Let’s Build Something Great 🚀

Building AI Systems
That Scale 🚀