This Actor extracts transcripts from Facebook video pages. It's designed to help you obtain text transcripts from videos posted on Facebook's platform.
Features
Extracts transcript data from Facebook video pages
Handles proper request headers to mimic a real browser
Provides detailed error reporting
Works with Apify proxy to avoid IP blocks and rate limiting
Simple configuration through INPUT_SCHEMA
Usage
Input Configuration
The Actor accepts the following input parameters:
Field
Type
Description
url
String
Required URL of the Facebook video page from which to extract the transcript
Apify Platform: The easiest way to run the Actor is through the Apify platform. Just search for "Facebook Video Transcript Extractor" in the Apify Store.
Command Line (via Apify CLI):
apify run -p
API: You can also run the Actor programmatically via the Apify API.
Output
The Actor saves extracted transcripts to the default dataset. Each item in the dataset has the following structure:
1{2"url":"https://web.facebook.com/briantylercohen/videos/1350752639547526",3"transcript":"This is the extracted transcript text...",4}
In case of errors or if no transcript is found, the output will look like:
1{2"url":"https://web.facebook.com/briantylercohen/videos/1350752639547526",3"transcript":null,4"error":"Error message or 'No transcript found in the page'"5}
Limitations
This Actor relies on the current structure of Facebook's video pages. If Facebook changes their page structure or how transcripts are embedded, the Actor may need to be updated.
Facebook may rate-limit or block requests that appear automated. Using the Apify proxy helps mitigate this issue.
Not all Facebook videos have transcripts available.
Technical Details
The Actor performs the following steps:
Takes the input URL and configures the HTTP request with browser-like headers
Fetches the HTML content of the Facebook video page
Parses the page to locate script tags containing transcript data
Extracts the transcript using a regex pattern
Saves the results to the Apify dataset
Dependencies
axios: For making HTTP requests
jsdom: For parsing and traversing the HTML
apify: The Apify SDK for integrating with the Apify platform
License
This project is licensed under the Apache License 2.0.
Variation 1: Standard & Clean
Variation 2: Minimalist Flat
Variation 3: Vector Illustration Style
Variation 4: Specific Colors (Hex Codes)
T
Frequently Asked Questions
Is it legal to scrape job listings or public data?
Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.
Do I need to code to use this scraper?
No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.
What data does it extract?
It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.
Can I scrape multiple pages or filter by location?
Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.
How do I get started?
You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!