AI Architecture

The Lexigram AI subsystem is organized into six layers. Each layer builds on the one below it, and all AI packages follow a strict dependency rule: they import only from lexigram and lexigram-contracts, never from each other.

graph TD
    A["Integration Layer<br/>lexigram-ai-mcp"] --> B["Infrastructure Layer<br/>lexigram-ai-workers, -observability, -feedback"]
    B --> C["Memory Layer<br/>lexigram-ai-memory, -session"]
    C --> D["Reasoning Layer<br/>lexigram-ai-agents, -skills"]
    D --> E["Knowledge Layer<br/>lexigram-ai-rag, lexigram-vector"]
    E --> F["Base Layer<br/>lexigram-ai-llm"]

1. Base Layer — `lexigram-ai-llm`

The LLM client protocol. Provides:

Provider routing: Route requests to OpenAI, Anthropic, Google, and local models via a unified interface
Thinking suppression: Configurable control over chain-of-thought output from reasoning models
Token tracking: Usage accounting and cost estimation

All higher layers depend on this package for model access.

2. Knowledge Layer — `lexigram-ai-rag` + `lexigram-vector`

Retrieval-Augmented Generation and vector storage.

Document ingestion: Chunking, embedding, and indexing pipelines
Retrieval: Hybrid search (semantic + keyword), re-ranking, contextual compression
Vector storage: Abstraction over pgvector, Qdrant, Pinecone, and in-memory

3. Reasoning Layer — `lexigram-ai-agents` + `lexigram-ai-skills`

Multi-step reasoning and tool use.

Agents: Loop-based reasoning with tool selection, error recovery, and structured output
Skills: Reusable tool definitions that agents can discover and invoke at runtime
Orchestration: Parallel tool execution, conditional branching, sub-agent delegation

4. Memory Layer — `lexigram-ai-memory` + `lexigram-ai-session`

Conversation history and persistent knowledge.

Episodic memory: Per-conversation message history with summarization
Semantic memory: Cross-session facts, user preferences, learned knowledge
Session management: Conversation lifecycle, state persistence, expiry

5. Integration Layer — `lexigram-ai-mcp`

Expose AI capabilities as Model Context Protocol tools, resources, and prompts.

MCP server: Wrap agents, RAG pipelines, and skills as MCP tools
MCP client: Connect to external MCP servers from within agents
Discovery: Dynamic tool registration and capability advertisement

6. Infrastructure Layer — `lexigram-ai-workers` + `lexigram-ai-observability` + `lexigram-ai-feedback`

Production-grade AI infrastructure.

Background processing: Async task execution for embedding, indexing, and batch inference
Observability: Token usage logging, latency tracking, cost attribution per-user/per-conversation
Feedback loops: User feedback collection, preference fine-tuning data pipelines

Dependency Direction

Each layer depends only on the layers below it. The base layer depends only on lexigram and lexigram-contracts. The integration layer can optionally consume any layer below it, but never introduces upward dependencies.

See the Packages reference for individual package APIs and the Choosing Backends guide for vector store comparisons.

For version requirements and known constraints, refer to Compatibility & Dependencies.

AI Architecture

1. Base Layer — lexigram-ai-llm

2. Knowledge Layer — lexigram-ai-rag + lexigram-vector

3. Reasoning Layer — lexigram-ai-agents + lexigram-ai-skills

4. Memory Layer — lexigram-ai-memory + lexigram-ai-session

5. Integration Layer — lexigram-ai-mcp

6. Infrastructure Layer — lexigram-ai-workers + lexigram-ai-observability + lexigram-ai-feedback