Question 1

Does this work with any YouTube video?

Accepted Answer

Yes, any video with captions. This includes auto-generated captions and manually added subtitles. If a video has no captions at all, no transcript will be available.

Question 2

Is it really free?

Accepted Answer

Completely free. No account, no limits, no API key required for 2outube. You just change the URL and get the transcript.

Question 3

Can I index multiple YouTube videos into the same LlamaIndex index?

Accepted Answer

Yes. Grab transcripts from multiple videos via 2outube, create a separate Document for each, and pass them all as a list to VectorStoreIndex.from_documents([doc1, doc2, doc3]). LlamaIndex will chunk and index all of them together, and metadata fields like video_id help you trace which chunk came from which video.

Question 4

What LlamaIndex version does this work with?

Accepted Answer

This works with llama-index-core 0.10+ (the modular package structure). For older versions using the monolithic llama-index package, the same Document and VectorStoreIndex classes apply — just import paths differ slightly.

Question 5

Does the transcript include timestamps?

Accepted Answer

2outube provides the full transcript text. If you need precise timestamps for chunk metadata, you can use the raw transcript format which includes timing data — useful for linking LlamaIndex responses back to specific moments in the video.

Question 6

What embedding model should I use for YouTube transcripts in LlamaIndex?

Accepted Answer

OpenAI's text-embedding-3-small is a solid default for most transcript use cases. For local/offline indexing, nomic-embed-text via Ollama works well. LlamaIndex lets you configure this with Settings.embed_model before building your index.

Question 7

How do I handle long transcripts that exceed token limits?

Accepted Answer

LlamaIndex automatically chunks documents using its node parser before indexing. The default chunk size is 1024 tokens with 20-token overlap. For very long transcripts (lectures, full courses), you can tune this with SentenceSplitter(chunk_size=512, chunk_overlap=50) for more granular retrieval.

Question 8

Can I build a RAG chatbot over a YouTube channel using this method?

Accepted Answer

Yes. Collect video IDs from a channel, grab each transcript via 2outube, create one Document per video, and index them all together. Add the video title and URL to each Document's metadata so your chatbot can cite sources. This is a common pattern for building knowledge bases from educational YouTube channels.

Send YouTube Transcripts to LlamaIndex

The Trick

Using Transcripts with LlamaIndex

Grab the transcript from 2outube

Create a LlamaIndex Document

Build a VectorStoreIndex

Query over the video content

Quick Start

Get the transcript

Change youtube to 2outube

Index it in LlamaIndex

Ready-Made Template

Questions

Start indexing YouTube videos in LlamaIndex — free