Facebook Video Transcript Extractor

Extract transcripts from Facebook video

linen_snack

$0.002

This Actor extracts transcripts from Facebook video pages. It's designed to help you obtain text transcripts from videos posted on Facebook's platform.

Features

Extracts transcript data from Facebook video pages
Handles proper request headers to mimic a real browser
Provides detailed error reporting
Works with Apify proxy to avoid IP blocks and rate limiting
Simple configuration through INPUT_SCHEMA

Usage

Input Configuration

The Actor accepts the following input parameters:

Field	Type	Description
`url`	String	Required URL of the Facebook video page from which to extract the transcript
`proxyConfiguration`	Object	Optional proxy settings to route requests through

Example input:

1{
2  "url": "https://web.facebook.com/briantylercohen/videos/1350752639547526",
3  "proxyConfiguration": {
4    "useApifyProxy": true
5  }
6}

Running the Actor

Apify Platform: The easiest way to run the Actor is through the Apify platform. Just search for "Facebook Video Transcript Extractor" in the Apify Store.
Command Line (via Apify CLI):
```
apify run -p
```
API: You can also run the Actor programmatically via the Apify API.

Output

The Actor saves extracted transcripts to the default dataset. Each item in the dataset has the following structure:

1{
2  "url": "https://web.facebook.com/briantylercohen/videos/1350752639547526",
3  "transcript": "This is the extracted transcript text...",
4}

In case of errors or if no transcript is found, the output will look like:

1{
2  "url": "https://web.facebook.com/briantylercohen/videos/1350752639547526",
3  "transcript": null,
4  "error": "Error message or 'No transcript found in the page'"
5}

Limitations

This Actor relies on the current structure of Facebook's video pages. If Facebook changes their page structure or how transcripts are embedded, the Actor may need to be updated.
Facebook may rate-limit or block requests that appear automated. Using the Apify proxy helps mitigate this issue.
Not all Facebook videos have transcripts available.

Technical Details

The Actor performs the following steps:

Takes the input URL and configures the HTTP request with browser-like headers
Fetches the HTML content of the Facebook video page
Parses the page to locate script tags containing transcript data
Extracts the transcript using a regex pattern
Saves the results to the Apify dataset

Dependencies

axios: For making HTTP requests
jsdom: For parsing and traversing the HTML
apify: The Apify SDK for integrating with the Apify platform

License

This project is licensed under the Apache License 2.0.

Variation 1: Standard & Clean alt text

Variation 2: Minimalist Flat alt text

Variation 3: Vector Illustration Style alt text

Variation 4: Specific Colors (Hex Codes) alt text

Frequently Asked Questions

Is it legal to scrape job listings or public data?

Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.

Do I need to code to use this scraper?

No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.

What data does it extract?

It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.

Can I scrape multiple pages or filter by location?

Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.

How do I get started?

You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!