Spring AI Chat Bot CLI

This Spring Boot, CLI, application demonstrates how to create an AI-powered chatbot with domain-specific knowledge (in this case, about Hurricane Milton) using Spring AI, Retrieval-Augmented Generation RAG and Conversational Memory.

Application uses the Hurricane_Milton wikipage saved as rag/wikipedia-hurricane-milton.pdf.

Requirements

Java 21
Spring Boot 4.0.1
Spring AI 2.0.0-SNAPSHOT

ChatBot Application

quick build run the app.

./mvnw clean install
./mvnw spring-boot:run

Auto-configurations

AI Model

By default, this project uses Anthropic's Spring Boot starter (spring-ai-starter-model-anthropic) with the Claude model. However, you can easily switch to any other supported AI model. The pom.xml file provides few alternative AI model dependencies (OpenAI, Ollama). (Note: Most models, except Ollama/Llama3.2, require an API key for access.) Configure your API key and other model properties in the application.properties file. The Chat Model API lists all supported models.

Embedding Model

The project uses the Transformers (ONNX) embedding model (spring-ai-starter-model-transformers) for generating embeddings. This works well in combination with the Anthropic Claude model.

Vector Store

The project is configured to use Chroma (spring-ai-starter-vector-store-chroma) as a vector store, running locally via Docker (Chroma version 1.0.0). A docker-compose.yaml file is provided to start a local Chroma instance. The project is configured with Spring Boot Docker Compose integration for easy setup. (e.g. you don't have to start the docker-compose manually). Find more about Vector Stores

RAG Advisor

The project uses the spring-ai-advisors-vector-store dependency to provide RAG capabilities through the QuestionAnswerAdvisor.

PDF Document Processing

PDF document reading capability is enabled through the spring-ai-pdf-document-reader dependency. The PDF documents are located in the src/main/resources/rag/ directory. Find more about the Spring AI document indexing support

CommandLineRunner

CommandLineRunner created by the cli Bean, is a Spring Boot interface for running code after the application context is loaded. This is the entry point of our chatbot the application.

Vector Store Loading

vectorStore.add(new TokenTextSplitter().split(new PagePdfDocumentReader(hurricaneMilton).read()));

This line reads a PDF document about Hurricane Milton, splits it into tokens, and adds it to a vector store. This is part of the RAG setup, allowing the chatbot to retrieve relevant information.

ChatClient Configuration

SearchRequest searchRequest = SearchRequest.builder()
    .topK(3)
    .build();

var chatClient = chatClientBuilder
    .defaultSystem("You are useful assistant, expert in hurricanes. Be friendly")
    .defaultAdvisors(MessageChatMemoryAdvisor.builder(MessageWindowChatMemory.builder().maxMessages(500).build()).build())
    .defaultAdvisors(QuestionAnswerAdvisor.builder(this.vectorStore).searchRequest(searchRequest).build())
    .build();

Here, a ChatClient is built with the following configurations:

A system prompt defining the assistant's role
A MessageChatMemoryAdvisor with MessageWindowChatMemory for maintaining conversation history (up to 500 messages)
A QuestionAnswerAdvisor that uses the vector store for RAG capabilities with configurable search parameters

Chat Loop

try (Scanner scanner = new Scanner(System.in)) {
    while (true) {
        System.out.print("\nUSER: ");
        System.out.println("\nASSISTANT: " + 
            chatClient.prompt(scanner.nextLine())
                .call()
                .content());
    }
}

This creates an infinite loop that:

Prompts the user for input
Sends the user's input to the chatbot
Prints the chatbot's response

The chatbot uses the configured ChatClient, which incorporates the conversation history and RAG capabilities to generate responses.

Key Features

RAG Implementation: The application uses a vector store to implement RAG, allowing the chatbot to retrieve relevant information from the loaded document.
Conversation Memory: The MessageChatMemoryAdvisor with MessageWindowChatMemory enables the chatbot to remember previous interactions within the conversation.
PDF Document Processing: The application can read and process PDF documents, making the information available to the chatbot.
Interactive Console Interface: The application provides a simple console-based interface for interacting with the chatbot.
Configurable Search: The RAG advisor supports configurable search parameters through SearchRequest.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.mvn/wrapper		.mvn/wrapper
src/main		src/main
.gitignore		.gitignore
README.md		README.md
docker-compose.yaml		docker-compose.yaml
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Spring AI Chat Bot CLI

Requirements

ChatBot Application

Auto-configurations

AI Model

Embedding Model

Vector Store

RAG Advisor

PDF Document Processing

CommandLineRunner

Vector Store Loading

ChatClient Configuration

Chat Loop

Key Features

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

tzolov/spring-ai-cli-chatbot

Folders and files

Latest commit

History

Repository files navigation

Spring AI Chat Bot CLI

Requirements

ChatBot Application

Auto-configurations

AI Model

Embedding Model

Vector Store

RAG Advisor

PDF Document Processing

CommandLineRunner

Vector Store Loading

ChatClient Configuration

Chat Loop

Key Features

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages