Chat

node-llm provides a unified chat interface across all providers (OpenAI, Gemini, Anthropic). It normalizes the differences in their APIs, allowing you to use a single set of methods for interacting with them.

Starting a Conversation

The core entry point is LLM.chat(model_id, options?).

import { LLM } from "@node-llm/core";

// Create a chat instance
const chat = LLM.chat("gpt-4o-mini");

// Ask a question
const response = await chat.ask("What is the capital of France?");

console.log(response.content); // "The capital of France is Paris."

Continuing the Conversation

The chat object maintains a history of the conversation, so you can ask follow-up questions naturally.

await chat.ask("What is the capital of France?");
// => "Paris"

await chat.ask("What is the population there?");
// => "The population of Paris is approximately..."

System Prompts (Instructions)

Guide the AI’s behavior, personality, or constraints using system prompts. You can set this when creating the chat or update it later.

// Option 1: Set at initialization
const chat = LLM.chat("gpt-4o", {
  systemPrompt: "You are a helpful assistant that answers in rhyming couplets."
});

// Option 2: Set or update later
chat.withInstructions("Now speak like a pirate.");

await chat.ask("Hello");
// => "Ahoy matey! The seas are calm today."

Custom HTTP Headers

Some providers offer beta features or require specific headers (like for observability proxies).

// Enable Anthropic's beta features
const chat = LLM.chat("claude-3-5-sonnet")
  .withRequestOptions({
    headers: {
      "anthropic-beta": "max-tokens-3-5-sonnet-2024-07-15"
    }
  });

await chat.ask("Tell me about the weather");

Raw Content Blocks (Advanced)

For advanced use cases like Anthropic Prompt Caching, you can pass provider-specific content blocks directly. node-llm attempts to pass array content through to the provider.

// Example: Anthropic Prompt Caching
const systemBlock = {
  type: "text",
  text: "You are a coding assistant. (Cached context...)",
  cache_control: { type: "ephemeral" }
};

const chat = LLM.chat("claude-3-5-sonnet", {
  systemPrompt: systemBlock as any // Cast if strict types complain
});

Working with Different Models

Switching providers is as simple as changing the model ID strings. node-llm automatically detects which provider to use based on the model name.

// OpenAI
const gpt = LLM.chat("gpt-4o");

// Anthropic
const claude = LLM.chat("claude-3-5-sonnet-20241022");

// Gemini
const gemini = LLM.chat("gemini-1.5-pro");

Temperature & Creativity

Adjust the randomness of the model’s responses using .withTemperature(0.0 - 1.0).

// Deterministic / Factual (Low Temperature)
const factual = LLM.chat("gpt-4o").withTemperature(0.0);

// Creative / Random (High Temperature)
const creative = LLM.chat("gpt-4o").withTemperature(0.9);

Lifecycle Events

Hook into the chat lifecycle for logging, UI updates, audit trails, or debugging.

chat
  .onNewMessage(() => console.log("AI started typing..."))
  .onToolCall((tool) => console.log(`Calling tool: ${tool.function.name}`))
  .onToolResult((result) => console.log(`Tool result: ${result}`))
  .onEndMessage((response) => {
    console.log(`Finished. Total tokens: ${response.total_tokens}`);
  });

await chat.ask("What's the weather?");

Next Steps

Multi-modal Capabilities (Images, Audio, Files)
Structured Output (JSON Schemas, Zod)
Tool Calling
Reasoning (DeepSeek R1, OpenAI o1/o3)