sprintstart-ai

The AI and RAG pipeline service for SprintStart, an AI-assisted onboarding and knowledge-retrieval platform for software development teams.

Prerequisites

Python 3.12+
uv
Ollama running locally with the required models pulled:

ollama pull llama3.2
ollama pull nomic-embed-text

# Optional — required only for image captioning
ollama pull llava:7b        # lightweight, good for development
# ollama pull qwen2-vl:7b  # recommended for production (better on diagrams/text-heavy images)

Getting Started

Local

# 1. Install dependencies
uv sync

# 2. Configure environment
cp .env.example .env
# Edit .env and fill in the values

# 3. Run the service
uv run python -m src.main

The service runs on port 8000. Interactive docs are available at /docs.

Docker

# 1. Configure environment
cp .env.example .env
# Edit .env and fill in the values

# 2. Start the service
docker-compose up --build

The service runs on port 8000.

OLLAMA_BASE_URL is automatically overridden to http://host.docker.internal:11434 inside the container, so no manual change is needed.

Environment Variables

Variable	Example value	Description
`LLM_BACKEND`	`ollama`	LLM backend to use. Currently only `ollama` is supported.
`OLLAMA_BASE_URL`	`http://localhost:11434`	Base URL of the Ollama instance. Use `http://host.docker.internal:11434` when running via Docker with Ollama on the host.
`OLLAMA_MODEL`	`gemma4:e4b`	Chat model to use for generation.
`OLLAMA_EMBED_MODEL`	`nomic-embed-text:latest`	Embedding model to use for ingestion and retrieval.
`OLLAMA_VISION_MODEL`	`llava:7b` (dev) / `qwen2-vl:7b` (prod)	Vision model for image captioning. Optional — if unset, image files are accepted but produce no chunks (`chunk_count=0`).
`CHROMA_PATH`	`./data/chroma`	Path for ChromaDB persistent storage. If unset, an in-memory store is used and data will not persist.
`AGENT_DEBUG`	`0`	When set to a truthy value, logs each agent's reasoning step (LLM text and tool calls) to stderr. Disabled by default and for `0`/`false`/`no`/`off`/empty.
`CHUNK_SIZE`	`512`	Maximum number of characters per chunk
`CHUNK_OVERLAP`	`64`	Number of characters reused between consecutive chunks to preserve context when splitting large chunks

API Endpoints

Method	Path	Description
`GET`	`/api/v1/health`	Reports service health including LLM backend status. Returns `503` if Ollama is unreachable.
`POST`	`/api/v1/ingest`	Parses, chunks, and embeds a document and stores it in the vector store. Re-ingesting the same `artifact_id` replaces existing chunks. Supports text files and images (send image content as base64; requires `OLLAMA_VISION_MODEL`).
`POST`	`/api/v1/chat`	Retrieves relevant chunks and streams a generated answer as Server-Sent Events (SSE).
`POST`	`/api/v1/title`	Generates a short descriptive title from a user prompt using an LLM and respecting the given max character length.

Chat SSE stream

The /api/v1/chat endpoint streams newline-delimited JSON events:

Event type	Description
`tool_use`	A capability the orchestrator invoked, in order. Has `name` and `kind` (`agent` for a sub-agent, `tool` for a leaf tool)
`token`	A single token fragment of the answer
`citation`	A source chunk used to generate the answer
`done`	Signals the end of the stream
`error`	Emitted on failure instead of the above

Running Tests

uv run pytest

With coverage:

uv run pytest --cov=src --cov-report=term-missing

Name		Name	Last commit message	Last commit date
Latest commit History 185 Commits
.github/workflows		.github/workflows
scripts		scripts
src		src
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

sprintstart-ai

Prerequisites

Getting Started

Local

Docker

Environment Variables

API Endpoints

Chat SSE stream

Running Tests

About

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

sprintstart-ai

Prerequisites

Getting Started

Local

Docker

Environment Variables

API Endpoints

Chat SSE stream

Running Tests

About

Resources

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages