semantic-qa-gen
Visit WebsiteGitHub RepoAI / Developer Tools (LLM toolingIdea / Pre-seed (early open-source project; minimal traction indicated by stars)Unknown (not specified in provided repository metadata)
Description
A Python library for generating high-quality question-answer pairs from PDF, DOCX, MD, and TXT files (supports workflows around RAG and multiple LLM backends such as OpenAI, Ollama, GPT4All, and LM Studio).
Founders
Bazinga23451 (GitHub owner; individual founder/maintainer inferred)
Discovered
April 26, 2025
Added to Database
January 25, 2026
Notes
Useful infrastructure for teams building RAG and fine-tuning pipelines by automating high-quality QA dataset creation from common document formats. Multi-backend LLM compatibility (OpenAI + local runtimes) makes it attractive for cost-sensitive and privacy-conscious deployments.