YoutubeTranscriptReader#

pydantic model llama_index.readers.YoutubeTranscriptReader#

Youtube Transcript reader.

Show JSON schema
{
   "title": "YoutubeTranscriptReader",
   "description": "Youtube Transcript reader.",
   "type": "object",
   "properties": {
      "is_remote": {
         "title": "Is Remote",
         "default": true,
         "type": "boolean"
      },
      "languages": {
         "title": "Languages",
         "default": [
            "en"
         ],
         "type": "array",
         "items": {}
      },
      "class_name": {
         "title": "Class Name",
         "type": "string",
         "default": "YoutubeTranscriptReader"
      }
   }
}

Config
  • arbitrary_types_allowed: bool = True

Fields
  • is_remote (bool)

  • languages (tuple)

field is_remote: bool = True#
field languages: tuple = ('en',)#
classmethod class_name() str#

Get the class name, used as a unique ID in serialization.

This provides a key that makes serialization robust against actual class name changes.

load_data(ytlinks: List[str], **load_kwargs: Any) List[Document]#

Load data from the input links.

Parameters

pages (List[str]) – List of youtube links for which transcripts are to be read.