web-augmented-generation
This Node.js application performs web-augmented generation using various LLM providers and web search results from SearXNG.
Rephrases user queries for web search, fetches results from a SearXNG instance, summarizes the content, and generates a final response via an OpenAI-compatible API (supporting Ollama, together.ai, llama.cpp). Streams responses for real-time output and implements content similarity checks with repetition detection. Ships with an interactive CLI and detailed logging.