@drorgl/youtube-captions-scraper
TypeScript icon, indicating that this package has built-in type declarations

1.0.3 • Public • Published

Youtube Captions Scraper

Note: This is forked from syzer/youtube-captions-scraper, typescriptified and added a few other options

  • Fetch Caption Tracks
  • Fetch Subtitles by language
  • Fetch Auto-Generated Captions
  • Fetch Full Text of Captions
  • Optional external HTTP requester

Installation

> npm install @drorgl/youtube-captions-scraper

Usage

import {
  YoutubeCaptions, 
  getSubtitles, 
  getSubtitlesContent 
  } from 'youtube-captions-scraper';

//Get Subtitles for Video
getSubtitles({
  videoID: 'XXXXX', // youtube video id
  lang: 'fr' // default: `en`
}).then(captions => {
  console.log(captions);
});

//Get full transcription as one string
const subtitles = await getSubtitlesContent({ videoID: "P-ygmGxuiEI" });

let youtubeCaptions = new YoutubeCaptions('XXXXX' /*youtube video id*/);

//retrieve caption tracks
let captionTracks = await youtubeCaptions.getCaptionTracks();

//retrieve subtitles by language
let subtitles = await youtubeCaptions.getSubtitles('en' /*optional language*/);

Captions will be an array of object of this format:

{
  "start": Number,
  "dur": Number,
  "text": String, //HTML stripped subtitles
  "htmlText":String
}

Readme

Keywords

none

Package Sidebar

Install

npm i @drorgl/youtube-captions-scraper

Weekly Downloads

0

Version

1.0.3

License

MIT

Unpacked Size

23.1 kB

Total Files

26

Last publish

Collaborators

  • drorgl