DeepCitation + LangChain

Add verifiable citations to any LangChain RAG pipeline. DeepCitation fits as a pre/post step around your existing chain — no restructuring required.

Target use case: Backend RAG pipelines (legal, medical, financial AI) where you need deterministic citation verification, not just retrieval similarity scores.

How DC Fits Into a LangChain Pipeline

In a typical LangChain pipeline you load documents, retrieve relevant chunks, and pass context to an LLM. DeepCitation replaces the context-injection step with its own document processing, then verifies the LLM’s citations after generation:

Standard LangChain:        Load → Retrieve → Prompt → LLM → Output
With DeepCitation:   Load ──────────────────────── DC prepares → Prompt → LLM → Output → DC verifies

Key difference from retrieval: LangChain retrieval returns chunks by similarity. DeepCitation processes the entire source document with internal line IDs, then verifies that the LLM’s citations point to real text at that exact location. You get a proof image, not just a similarity score.

Prerequisites

npm install deepcitation langchain @langchain/openai

# .env
DEEPCITATION_API_KEY=dc_live_YOUR_API_KEY
OPENAI_API_KEY=sk-your-key

Complete Pipeline Example

This is a self-contained, runnable pipeline. It loads a PDF, prepares it for citation, runs a LangChain chat model, and verifies the output.

import { readFileSync } from "node:fs";
import { ChatOpenAI } from "@langchain/openai";
import { HumanMessage, SystemMessage } from "@langchain/core/messages";
import {
  DeepCitation,
  wrapCitationPrompt,
  getAllCitationsFromLlmOutput,
} from "deepcitation";

const dc = new DeepCitation({ apiKey: process.env.DEEPCITATION_API_KEY! });

async function answerWithCitations(pdfPath: string, question: string) {
  // 1. Read the source document as a Buffer
  //    DC needs the raw file to extract text with internal line IDs.
  //    LangChain Document objects (from loaders) don't contain this —
  //    you must pass the original file.
  const fileBuffer = readFileSync(pdfPath);

  // 2. Upload to DeepCitation
  //    Returns deepTextPages: the raw document pages for citation-aware
  //    prompting, and fileDataParts for verification.
  const { fileDataParts, deepTextPages } = await dc.prepareAttachments([
    { file: fileBuffer, filename: pdfPath.split("/").pop()! },
  ]);

  // 3. Wrap your prompts with citation instructions
  //    This injects the document content + citation format rules into the prompt.
  const { enhancedSystemPrompt, enhancedUserPrompt } = wrapCitationPrompt({
    systemPrompt:
      "You are a precise research assistant. Answer questions based only on the provided documents.",
    userPrompt: question,
    deepTextPages,
  });

  // 4. Call your LangChain model — no special DC integration needed here
  const model = new ChatOpenAI({ model: "gpt-4o-mini", temperature: 0 });

  const response = await model.invoke([
    new SystemMessage(enhancedSystemPrompt),
    new HumanMessage(enhancedUserPrompt),
  ]);

  const llmOutput = response.content as string;

  // 5. Extract and verify citations
  //    getAllCitationsFromLlmOutput parses numeric [N] markers from the LLM response's <<<CITATION_DATA>>> block.
  //    verifyAttachment checks each citation against the source document.
  const citations = getAllCitationsFromLlmOutput(llmOutput);
  const citationCount = Object.keys(citations).length;

  if (citationCount === 0) {
    return { llmOutput, citations: {}, verifications: {} };
  }

  const { verifications } = await dc.verifyAttachment(
    fileDataParts[0].attachmentId,
    citations,
    { outputImageFormat: "webp" },
  );

  return { llmOutput, citations, verifications };
}

// Usage
const result = await answerWithCitations(
  "./contracts/service-agreement.pdf",
  "What are the termination conditions?",
);

console.log(result.llmOutput);

for (const [key, verification] of Object.entries(result.verifications)) {
  console.log(`[${key}] status=${verification.status}`);

  if (verification.evidence?.src) {
    // Save or serve the visual proof image
    console.log(`  proof image available`);
  }
}

RunnableSequence Integration

If you’re building a reusable chain, wrap the DC steps around a RunnableSequence. The pre-step (document preparation) runs before the chain and passes attachment context through the chain’s input.

import { RunnableSequence, RunnableLambda } from "@langchain/core/runnables";
import { StringOutputParser } from "@langchain/core/output_parsers";
import { ChatPromptTemplate } from "@langchain/core/prompts";
import { ChatOpenAI } from "@langchain/openai";
import {
  DeepCitation,
  wrapCitationPrompt,
  getAllCitationsFromLlmOutput,
  type CitationRecord,
  type VerificationRecord,
} from "deepcitation";

const dc = new DeepCitation({ apiKey: process.env.DEEPCITATION_API_KEY! });

interface PipelineInput {
  question: string;
  // Passed in from the pre-step (document preparation)
  deepTextPages: string[];
  attachmentId: string;
}

interface PipelineOutput {
  answer: string;
  citations: CitationRecord;
  verifications: VerificationRecord;
}

// The inner chain handles prompt formatting + LLM call
const citationChain = RunnableSequence.from([
  // Enhance the prompt with citation instructions
  new RunnableLambda({
    func: (input: PipelineInput) => {
      const { enhancedSystemPrompt, enhancedUserPrompt } = wrapCitationPrompt({
        systemPrompt:
          "You are a precise research assistant. Cite sources for every factual claim.",
        userPrompt: input.question,
        deepTextPages: input.deepTextPages,
      });
      return {
        system: enhancedSystemPrompt,
        human: enhancedUserPrompt,
        attachmentId: input.attachmentId,
      };
    },
  }),
  // Call the model
  ChatPromptTemplate.fromMessages([
    ["system", "{system}"],
    ["human", "{human}"],
  ]),
  new ChatOpenAI({ model: "gpt-4o-mini", temperature: 0 }),
  new StringOutputParser(),
]);

// Full pipeline: prepare → chain → verify
async function runCitationPipeline(
  fileBuffer: Buffer,
  filename: string,
  question: string,
): Promise<PipelineOutput> {
  // Pre-step: prepare DC attachment (runs before the chain)
  const { fileDataParts, deepTextPages } = await dc.prepareAttachments([
    { file: fileBuffer, filename },
  ]);

  const attachmentId = fileDataParts[0].attachmentId;

  // Run the inner chain
  const answer = await citationChain.invoke({
    question,
    deepTextPages,
    attachmentId,
  });

  // Post-step: verify citations (runs after the chain)
  const citations = getAllCitationsFromLlmOutput(answer);
  const citationCount = Object.keys(citations).length;

  if (citationCount === 0) {
    return { answer, citations: {}, verifications: {} };
  }

  const { verifications } = await dc.verifyAttachment(attachmentId, citations);

  return { answer, citations, verifications };
}

LangChain Documents vs. DeepCitation Attachments

	LangChain `Document`	DeepCitation attachment
Created by	`DocumentLoader`	`dc.prepareAttachments()`
Contents	`pageContent` (text chunks), `metadata`	Processed text with internal line IDs
Used for	Retrieval, embedding, context injection	Citation verification against exact source positions
Citation verification	Not supported	Yes — exact text + visual proof

Can you pass LangChain Document objects directly to DeepCitation? No. DC needs the raw source file to extract its internal line ID structure. LangChain’s document objects contain already-parsed text without the positional metadata DC requires.

Can you use both in the same pipeline? Yes. Use LangChain’s retriever to find relevant documents, then load those specific files as Buffers and pass to dc.prepareAttachments(). The DC-processed documents replace the retrieval context for citation-verified answers.

Multiple Documents

Pass multiple files to prepareAttachments in a single call. DeepCitation returns a deepTextPagesByAttachmentId map so each attachment stays explicit and order-independent:

import { groupCitationsByAttachmentId } from "deepcitation";

const { fileDataParts, deepTextPagesByAttachmentId } = await dc.prepareAttachments([
  { file: contractBuffer, filename: "contract.pdf" },
  { file: invoiceBuffer, filename: "invoice.pdf" },
]);

const { enhancedSystemPrompt, enhancedUserPrompt } = wrapCitationPrompt({
  systemPrompt: "You are a document analyst. Cite sources for every claim.",
  userPrompt: "What are the total costs and payment terms?",
  deepTextPagesByAttachmentId,
});

const model = new ChatOpenAI({ model: "gpt-4o-mini" });
const response = await model.invoke([
  new SystemMessage(enhancedSystemPrompt),
  new HumanMessage(enhancedUserPrompt),
]);

const citations = getAllCitationsFromLlmOutput(response.content as string);

// Citations from multiple docs — verify each attachment separately
const citationsByAttachment = groupCitationsByAttachmentId(citations);

const verificationResults = await Promise.all(
  Array.from(citationsByAttachment.entries()).map(([attachmentId, attachmentCitations]) =>
    dc.verifyAttachment(attachmentId, attachmentCitations),
  ),
);

Streaming with LangChain

LangChain supports streaming via .stream(). Citation verification requires the complete LLM output — collect the stream first, then verify:

import { concat } from "@langchain/core/utils/stream";

const model = new ChatOpenAI({ model: "gpt-4o-mini", streaming: true });

// Stream and collect
const stream = await model.stream([
  new SystemMessage(enhancedSystemPrompt),
  new HumanMessage(enhancedUserPrompt),
]);

let gathered;
for await (const chunk of stream) {
  // Forward chunk to your client while collecting
  process.stdout.write(chunk.content as string);
  gathered = gathered !== undefined ? concat(gathered, chunk) : chunk;
}

const llmOutput = gathered!.content as string;

// Now verify the complete output
const citations = getAllCitationsFromLlmOutput(llmOutput);
const { verifications } = await dc.verifyAttachment(attachmentId, citations);

Next Steps

Next.js App Router guide — server/client boundary patterns for React
API Reference — full prepareAttachments and verifyAttachment options
Verification Statuses — understanding isVerified, isMiss, isPartialMatch