semantic-qa-gen

Visit Website
GitHub RepoAI / Developer Tools (LLM toolingIdea / Pre-seed (early open-source project; minimal traction indicated by stars)Unknown (not specified in provided repository metadata)

Description

A Python library for generating high-quality question-answer pairs from PDF, DOCX, MD, and TXT files (supports workflows around RAG and multiple LLM backends such as OpenAI, Ollama, GPT4All, and LM Studio).

Founders

Bazinga23451 (GitHub owner; individual founder/maintainer inferred)

Discovered

April 26, 2025

Added to Database

January 25, 2026

Notes

Useful infrastructure for teams building RAG and fine-tuning pipelines by automating high-quality QA dataset creation from common document formats. Multi-backend LLM compatibility (OpenAI + local runtimes) makes it attractive for cost-sensitive and privacy-conscious deployments.

Related Links