IMDB - Movie data and ratings scraper
This actor can scrape movie info and movie ratings from page .
Actor cost
Actor using cheerio scraper which is efficient. Recommended RAM usage is 256MB.
Choose what you want scrape: specific title IDs (MOVIE_IDS)
or search keyword (QUERY) (limited to 5 titles per search)
or scrape IMDB chart (CHART)
.
If you set CHART. You can set LIMIT
and START_LIMIT_FROM
(optional)
Do you want scrape title data GET_MOVIES_DATA
, ratings GET_RATINGS
or IDs SAVE_IDs
?
Do you want save data to named dataset to have specific data in named dataset? Than set DATASET_NAME
or DATASET_NAME_RATINGS
or DATASET_NAME_ID
Inpot value Description Example MOVIE_IDS Scrape list of IDs from IMDB. ID can be find in url: https://www.imdb.com/title/tt10954600/ tt10954600
QUERY Search any keyword. It's limited to only 5 titles per search "Star Wars" CHART Scrape any IMDB chart. Like popular chart. moviemeter
START_LIMIT_FROM (optional) Starting number of scraping movies. Default value is 1 1
LIMIT (optional) Maximum number of movies that will be scraped. Default value is 50 500
GET_MOVIES_DATA Set true
if you want scrape movie data true
or false
GET_RATINGS Set true
if you want scrape ratings true
or false
SAVE_IDs (optional) Set true
if you want scrape IDs. Only for QUERY
or CHART
true
or false
DATASET_NAME (optional) Save scraped data to named dataset. Run actor many times with same name but different IDs than this dataset will contains all data. EXAMPLE_NAME
DATASET_NAME_RATINGS (optional) Save scraped ratings to named dataset. EXAMPLE_RATING
DATASET_NAME_ID (optional) Save scraped IDs to named dataset. EXAMPLE_ID
CLEAN_DATASET_NAMES (optional) Array of dataset name(s) which will be deleted first. ["EXAMPLE_NAME","EXAMPLE_RATING","EXAMPLE_ID"] proxyConfiguration (optional) Select proxy to avoid IP address blocking.
Scrape ratings for 3 movies:
1 {
2 "MOVIE_IDS": ["tt10954600", "tt12823454", "tt2911666"],
3 "GET_RATINGS": true,
4 "proxyConfiguration": {
5 "useApifyProxy": false
6 }
7 }
Scrape 5 movie data from search keyword "John Wick":
1 {
2 "QUERY": "John Wick",
3 "GET_MOVIES_DATA": true,
4 "proxyConfiguration": {
5 "useApifyProxy": false
6 }
7 }
Charts
IMDB has different chart.
Just set: { "CHART": "moviemeter", }
CHART value Popularity "moviemeter" A-Z "alpha" User Rating "user_rating" Number of Votes "num_votes" US Box Office "boxoffice_gross_us" Runtime "runtime" Year "year" Release Date "release_date" Date of Your Rating "your_rating_date" Your Rating "sort=my_ratings"
Result for MOVIE DATA
Input:
{ "GET_MOVIES_DATA": true }
1 {
2 "id": "tt2911666",
3 "title": "John Wick",
4 "releaseDate": "October 24, 2014 (United States)",
5 "director": [
6 "Chad Stahelski",
7 "David Leitch"
8 ],
9 "writers": [
10 "Derek Kolstad"
11 ],
12 "countryOfOrigin": [
13 "United States",
14 "United Kingdom",
15 "China"
16 ],
17 "rating": "7.4",
18 "popularity": "119",
19 "genres": [
20 "Action",
21 "Crime",
22 "Thriller"
23 ],
24 "officialSites": [
25 "http://www.facebook.com/johnwickmovie",
26 "http://www.johnwickthemovie.com/"
27 ],
28 "language": "EnglishRussianHungarian",
29 "locations": [
30 "Calvary Cemetery, Woodside, Queens, New York City, New York, USA"
31 ],
32 "ProductionCompanies": [
33 "Summit Entertainment",
34 "Thunder Road Pictures",
35 "87Eleven"
36 ],
37 "grossUSCanada": "$43,037,835",
38 "openingWeekendUSCanada": "$14,415,922",
39 "grossWorldwide": "$86,081,711",
40 "runtime": "1 hour 41 minutes",
41 "casts": [
42 "Keanu Reeves @ John Wick",
43 "Michael Nyqvist @ Viggo Tarasov",
44 "Alfie Allen @ Iosef Tarasov",
45 "Willem Dafoe @ Marcus",
46 "Dean Winters @ Avi",
47 "Adrianne Palicki @ Ms. Perkins",
48 "Omer Barnea @ Gregori",
49 "Toby Leonard Moore @ Victor",
50 "Daniel Bernhardt @ Kirill",
51 "Bridget Moynahan @ Helen",
52 "John Leguizamo @ Aurelio",
53 "Ian McShane @ Winston",
54 "Bridget Regan @ Addy",
55 "Lance Reddick @ Hotel Manager",
56 "Keith Jardine @ Kuzma",
57 "Tait Fletcher @ Nicholai",
58 "Kazy Tauginas @ Ivan",
59 "Alexander Frekey @ Alexander"
60 ],
61 "url": "https://www.imdb.com/title/tt2911666"
62 }
Result for ID's
Input:
{ "SAVE_IDs": true }
1 {
2 "url": "https://www.imdb.com/find/?q=John+Wick",
3 "titles": [
4 {
5 "title": "John Wick: Chapter 4",
6 "id": "tt10366206"
7 },
8 {
9 "title": "John Wick",
10 "id": "tt2911666"
11 },
12 {
13 "title": "John Wick: Chapter 2",
14 "id": "tt4425200"
15 },
16 {
17 "title": "John Wick: Chapter 3 - Parabellum",
18 "id": "tt6146586"
19 },
20 {
21 "title": "John Wick: Chapter 5",
22 "id": "tt12844100"
23 }
24 ]
25 }
Result for ratings
Input:
{ "GET_RATINGS": true }
Return ratings for all demographics:
'imdb_users', 'males', 'females', 'aged_under_18', 'aged_18_29', 'aged_30_44', 'aged_45_plus', 'males_aged_under_18', 'males_aged_18_29', 'males_aged_30_44', 'males_aged_45_plus', 'females_aged_under_18', 'females_aged_18_29', 'females_aged_30_44', 'females_aged_45_plus'
For all IMDB users:
1 {
2 "date": "22-02-2023",
3 "title": "John Wick (2014)",
4 "demographic": "imdb_users",
5 "averageVote": 7.4,
6 "arithmeticVote": 7.6,
7 "medianVote": 8,
8 "totalVotes": 645487,
9 "Votes_10": 79402,
10 "Percent_10": 12.3,
11 "Votes_9": 87601,
12 "Percent_9": 13.6,
13 "Votes_8": 188132,
14 "Percent_8": 29.1,
15 "Votes_7": 171542,
16 "Percent_7": 26.6,
17 "Votes_6": 68794,
18 "Percent_6": 10.7,
19 "Votes_5": 25724,
20 "Percent_5": 4,
21 "Votes_4": 10195,
22 "Percent_4": 1.6,
23 "Votes_3": 5509,
24 "Percent_3": 0.9,
25 "Votes_2": 3443,
26 "Percent_2": 0.5,
27 "Votes_1": 5145,
28 "Percent_1": 0.8,
29 "url": "https://www.imdb.com/title/tt2911666/ratings?demo=imdb_users"
30 }