queue-tha-llama

This is a web-based chat application that integrates Large Language Model (LLM) capabilities with Bull Queue, Redis, and Chroma. It handles concurrent chat s...

It handles concurrent chat sessions with queue management and maintains client-server communication with heartbeat signals. It uses a RAG model for chat memory and manages inactive clients and job cleanups.