PG
← Back to Projects
liveAI

PDF Sidekick

AI-Powered PDF Assistant — Chat, Extract, Summarize

Updated: Dec 10, 2025Created: Sep 1, 2024
PythonFastAPIPostgreSQLSentence TransformersGoogle Flan-T5-LargeNext.js 15TypeScriptWebSocketsCloudinary

What It Does

An AI-powered web app enabling users to chat with uploaded PDFs, leveraging embeddings + LLMs. Full RAG pipeline from PDF upload to context-aware answer synthesis.

Key Features

  • RAG pipeline — PDF upload → text extraction → recursive chunking → embeddings → vector search → LLM synthesis
  • OCR fallback — Tesseract OCR for scanned documents when PyPDF2 fails
  • Real-time WebSocket Q&A — Structured JSON messaging with processing status tracking
  • Context-aware answers — Semantic search with graceful fallback when no relevant context found
  • Full-stack — Next.js 15 frontend, FastAPI backend, NeonDB PostgreSQL, Cloudinary storage