Simple Contact Info and Social Media Scraper

Simple Contact Info and Social Media Scraper

This Apify actor is designed to crawl web pages and extract social media handles, emails, and phone numbers using Puppeteer. It can handle dynamic content and navigate through multiple pages, making it suitable for comprehensive data extraction tasks.

LEAD_GENERATIONECOMMERCESOCIAL_MEDIAApify

This Apify actor is designed to crawl web pages and extract social media handles, emails, and phone numbers using Puppeteer. It can handle dynamic content and navigate through multiple pages, making it suitable for comprehensive data extraction tasks.

If you're looking for examples or want to learn more visit:

Included features

  • Data Extraction: Extracts social media handles, emails, and phone numbers.
  • Dynamic Content Handling: Supports crawling through links and HTML frames.
  • Configurable: Set depth and request limits.
  • Proxy Support: Uses Apify's proxy configuration for anonymity and IP rotation.

How it works

  1. Input: Define start URLs in INPUT.json.
  2. Proxy Configuration: Set up proxies to avoid IP blocking.
  3. Crawler Setup: Use PuppeteerCrawler with custom routing.
  4. Request Handling: Customize page handling in routes.js.
  5. Execution: Start the crawler with crawler.run(startUrls);.

Input Configuration

1{
2    "considerChildFrames": true,
3    "maxDepth": 2,
4    "maxRequests": 100,
5    "sameDomain": true,
6    "startUrls": [
7        {
8            "url": "https://nonos.ph/",
9            "method": "GET"
10        }
11    ]
12}
  • startUrls: List of URLs to start crawling from.
  • proxyConfig: Configuration for using Apify's proxy services.
  • sameDomain: Restrict crawling to the same domain.
  • maxDepth: Maximum depth of links to follow.
  • considerChildFrames: Enable crawling of HTML frames.
  • maxRequests: Total number of requests to make.
  • maxRequestsPerStartUrl: Limit requests per start URL.

Output Dataset

1[
2    {
3        "depth": 0,
4        "referrerUrl": null,
5        "url": "https://cafelunanewyork.com/",
6        "domain": "cafelunanewyork.com",
7        "emails": [
8            "info@cafelunany.com"
9        ],
10        "phones": [
11            "16468611751",
12            "6468611751"
13        ],
14        "linkedIns": [],
15        "twitters": [],
16        "instagrams": [
17            "https://www.instagram.com/cafelunanyc"
18        ],
19        "facebooks": [
20            "https://www.facebook.com/102391985277345"
21        ],
22        "youtubes": [],
23        "tiktoks": [],
24        "pinterests": [],
25        "discords": []
26    },
27    {
28        "depth": 1,
29        "url": "https://cafelunanewyork.com/-food-menu?source=header&spot_id=93121&destination=food_menu&promotion=craft_beer",
30        "domain": "cafelunanewyork.com",
31        "emails": [
32            "info@cafelunany.com"
33        ],
34        "phones": [
35            "16468611751"
36        ],
37        "linkedIns": [],
38        "twitters": [],
39        "instagrams": [
40            "https://www.instagram.com/cafelunanyc"
41        ],
42        "facebooks": [
43            "https://www.facebook.com/102391985277345"
44        ],
45        "youtubes": [],
46        "tiktoks": [],
47        "pinterests": [],
48        "discords": []
49    },
50    {
51        "depth": 1,
52        "url": "https://cafelunanewyork.com/-food-menu?source=header&spot_id=93121&destination=food_menu&promotion=vegetarian_dishes",
53        "domain": "cafelunanewyork.com",
54        "emails": [
55            "info@cafelunany.com"
56        ],
57        "phones": [
58            "16468611751"
59        ],
60        "linkedIns": [],
61        "twitters": [],
62        "instagrams": [
63            "https://www.instagram.com/cafelunanyc"
64        ],
65        "facebooks": [
66            "https://www.facebook.com/102391985277345"
67        ],
68        "youtubes": [],
69        "tiktoks": [],
70        "pinterests": [],
71        "discords": []
72    },
73    {
74        "depth": 1,
75        "url": "https://cafelunanewyork.com/-food-menu?source=header&spot_id=93121&destination=food_menu&promotion=freshly_baked_daily",
76        "domain": "cafelunanewyork.com",
77        "emails": [
78            "info@cafelunany.com"
79        ],
80        "phones": [
81            "16468611751"
82        ],
83        "linkedIns": [],
84        "twitters": [],
85        "instagrams": [
86            "https://www.instagram.com/cafelunanyc"
87        ],
88        "facebooks": [
89            "https://www.facebook.com/102391985277345"
90        ],
91        "youtubes": [],
92        "tiktoks": [],
93        "pinterests": [],
94        "discords": []
95    },
96    {
97        "depth": 1,
98        "url": "https://cafelunanewyork.com/-food-menu?source=header&spot_id=93121&destination=food_menu&promotion=great_pastries",
99        "domain": "cafelunanewyork.com",
100        "emails": [
101            "info@cafelunany.com"
102        ],
103        "phones": [
104            "16468611751"
105        ],
106        "linkedIns": [],
107        "twitters": [],
108        "instagrams": [
109            "https://www.instagram.com/cafelunanyc"
110        ],
111        "facebooks": [
112            "https://www.facebook.com/102391985277345"
113        ],
114        "youtubes": [],
115        "tiktoks": [],
116        "pinterests": [],
117        "discords": []
118    },
119    {
120        "depth": 1,
121        "url": "https://cafelunanewyork.com/-food-menu?source=header&spot_id=93121&destination=food_menu&promotion=local_ingredients",
122        "domain": "cafelunanewyork.com",
123        "emails": [
124            "info@cafelunany.com"
125        ],
126        "phones": [
127            "16468611751"
128        ],
129        "linkedIns": [],
130        "twitters": [],
131        "instagrams": [
132            "https://www.instagram.com/cafelunanyc"
133        ],
134        "facebooks": [
135            "https://www.facebook.com/102391985277345"
136        ],
137        "youtubes": [],
138        "tiktoks": [],
139        "pinterests": [],
140        "discords": []
141    },
142    {
143        "depth": 1,
144        "url": "https://cafelunanewyork.com/-food-menu?source=header&spot_id=93121&destination=food_menu&promotion=great_sandwiches",
145        "domain": "cafelunanewyork.com",
146        "emails": [
147            "info@cafelunany.com"
148        ],
149        "phones": [
150            "16468611751"
151        ],
152        "linkedIns": [],
153        "twitters": [],
154        "instagrams": [
155            "https://www.instagram.com/cafelunanyc"
156        ],
157        "facebooks": [
158            "https://www.facebook.com/102391985277345"
159        ],
160        "youtubes": [],
161        "tiktoks": [],
162        "pinterests": [],
163        "discords": []
164    },
165    {
166        "depth": 1,
167        "url": "https://cafelunanewyork.com/-food-menu?source=header&spot_id=93121&destination=food_menu&promotion=great_burgers",
168        "domain": "cafelunanewyork.com",
169        "emails": [
170            "info@cafelunany.com"
171        ],
172        "phones": [
173            "16468611751"
174        ],
175        "linkedIns": [],
176        "twitters": [],
177        "instagrams": [
178            "https://www.instagram.com/cafelunanyc"
179        ],
180        "facebooks": [
181            "https://www.facebook.com/102391985277345"
182        ],
183        "youtubes": [],
184        "tiktoks": [],
185        "pinterests": [],
186        "discords": []
187    },
188    {
189        "depth": 1,
190        "url": "https://cafelunanewyork.com/-food-menu?source=header&spot_id=93121&destination=food_menu&promotion=comfort_food",
191        "domain": "cafelunanewyork.com",
192        "emails": [
193            "info@cafelunany.com"
194        ],
195        "phones": [
196            "16468611751"
197        ],
198        "linkedIns": [],
199        "twitters": [],
200        "instagrams": [
201            "https://www.instagram.com/cafelunanyc"
202        ],
203        "facebooks": [
204            "https://www.facebook.com/102391985277345"
205        ],
206        "youtubes": [],
207        "tiktoks": [],
208        "pinterests": [],
209        "discords": []
210    },
211    {
212        "depth": 1,
213        "url": "https://cafelunanewyork.com/-food-menu?source=header&spot_id=93121&destination=food_menu&promotion=gluten_free_dishes",
214        "domain": "cafelunanewyork.com",
215        "emails": [
216            "info@cafelunany.com"
217        ],
218        "phones": [
219            "16468611751"
220        ],
221        "linkedIns": [],
222        "twitters": [],
223        "instagrams": [
224            "https://www.instagram.com/cafelunanyc"
225        ],
226        "facebooks": [
227            "https://www.facebook.com/102391985277345"
228        ],
229        "youtubes": [],
230        "tiktoks": [],
231        "pinterests": [],
232        "discords": []
233    },
234    {
235        "depth": 1,
236        "url": "https://cafelunanewyork.com/-food-menu?source=header&spot_id=93121&destination=food_menu&promotion=organic",
237        "domain": "cafelunanewyork.com",
238        "emails": [
239            "info@cafelunany.com"
240        ],
241        "phones": [
242            "16468611751"
243        ],
244        "linkedIns": [],
245        "twitters": [],
246        "instagrams": [
247            "https://www.instagram.com/cafelunanyc"
248        ],
249        "facebooks": [
250            "https://www.facebook.com/102391985277345"
251        ],
252        "youtubes": [],
253        "tiktoks": [],
254        "pinterests": [],
255        "discords": []
256    },
257    {
258        "depth": 1,
259        "url": "https://cafelunanewyork.com/-food-menu?source=header&spot_id=93121&destination=food_menu&promotion=vegan_dishes",
260        "domain": "cafelunanewyork.com",
261        "emails": [
262            "info@cafelunany.com"
263        ],
264        "phones": [
265            "16468611751"
266        ],
267        "linkedIns": [],
268        "twitters": [],
269        "instagrams": [
270            "https://www.instagram.com/cafelunanyc"
271        ],
272        "facebooks": [
273            "https://www.facebook.com/102391985277345"
274        ],
275        "youtubes": [],
276        "tiktoks": [],
277        "pinterests": [],
278        "discords": []
279    },
280    {
281        "depth": 1,
282        "url": "https://cafelunanewyork.com/-food-menu?source=header&spot_id=93121&destination=food_menu&promotion=family_owned",
283        "domain": "cafelunanewyork.com",
284        "emails": [
285            "info@cafelunany.com"
286        ],
287        "phones": [
288            "16468611751"
289        ],
290        "linkedIns": [],
291        "twitters": [],
292        "instagrams": [
293            "https://www.instagram.com/cafelunanyc"
294        ],
295        "facebooks": [
296            "https://www.facebook.com/102391985277345"
297        ],
298        "youtubes": [],
299        "tiktoks": [],
300        "pinterests": [],
301        "discords": []
302    },
303    {
304        "depth": 1,
305        "url": "https://cafelunanewyork.com/new-york-cafe-luna-food-menu",
306        "domain": "cafelunanewyork.com",
307        "emails": [
308            "info@cafelunany.com"
309        ],
310        "phones": [
311            "16468611751"
312        ],
313        "linkedIns": [],
314        "twitters": [],
315        "instagrams": [
316            "https://www.instagram.com/cafelunanyc"
317        ],
318        "facebooks": [
319            "https://www.facebook.com/102391985277345"
320        ],
321        "youtubes": [],
322        "tiktoks": [],
323        "pinterests": [],
324        "discords": []
325    },
326    {
327        "depth": 1,
328        "url": "https://cafelunanewyork.com/accessibility-page-01",
329        "domain": "cafelunanewyork.com",
330        "emails": [
331            "info@cafelunany.com"
332        ],
333        "phones": [
334            "16468611751"
335        ],
336        "linkedIns": [],
337        "twitters": [],
338        "instagrams": [
339            "https://www.instagram.com/cafelunanyc"
340        ],
341        "facebooks": [
342            "https://www.facebook.com/102391985277345"
343        ],
344        "youtubes": [],
345        "tiktoks": [],
346        "pinterests": [],
347        "discords": []
348    },
349    {
350        "depth": 1,
351        "url": "https://cafelunanewyork.com/#content",
352        "domain": "cafelunanewyork.com",
353        "emails": [
354            "info@cafelunany.com"
355        ],
356        "phones": [
357            "16468611751",
358            "6468611751"
359        ],
360        "linkedIns": [],
361        "twitters": [],
362        "instagrams": [
363            "https://www.instagram.com/cafelunanyc"
364        ],
365        "facebooks": [
366            "https://www.facebook.com/102391985277345"
367        ],
368        "youtubes": [],
369        "tiktoks": [],
370        "pinterests": [],
371        "discords": []
372    },
373    {
374        "depth": 1,
375        "url": "https://cafelunanewyork.com/new-york-cafe-luna-drink-menu",
376        "domain": "cafelunanewyork.com",
377        "emails": [
378            "info@cafelunany.com"
379        ],
380        "phones": [
381            "16468611751"
382        ],
383        "linkedIns": [],
384        "twitters": [],
385        "instagrams": [
386            "https://www.instagram.com/cafelunanyc"
387        ],
388        "facebooks": [
389            "https://www.facebook.com/102391985277345"
390        ],
391        "youtubes": [],
392        "tiktoks": [],
393        "pinterests": [],
394        "discords": []
395    },
396    {
397        "depth": 2,
398        "url": "https://cafelunanewyork.com/61570",
399        "domain": "cafelunanewyork.com",
400        "emails": [],
401        "phones": [],
402        "linkedIns": [],
403        "twitters": [],
404        "instagrams": [],
405        "facebooks": [],
406        "youtubes": [],
407        "tiktoks": [],
408        "pinterests": [],
409        "discords": []
410    },
411    {
412        "depth": 2,
413        "url": "https://cafelunanewyork.com/86157",
414        "domain": "cafelunanewyork.com",
415        "emails": [],
416        "phones": [],
417        "linkedIns": [],
418        "twitters": [],
419        "instagrams": [],
420        "facebooks": [],
421        "youtubes": [],
422        "tiktoks": [],
423        "pinterests": [],
424        "discords": []
425    },
426    {
427        "depth": 2,
428        "url": "https://cafelunanewyork.com/86145",
429        "domain": "cafelunanewyork.com",
430        "emails": [],
431        "phones": [],
432        "linkedIns": [],
433        "twitters": [],
434        "instagrams": [],
435        "facebooks": [],
436        "youtubes": [],
437        "tiktoks": [],
438        "pinterests": [],
439        "discords": []
440    },
441    {
442        "depth": 2,
443        "url": "https://cafelunanewyork.com/61568",
444        "domain": "cafelunanewyork.com",
445        "emails": [],
446        "phones": [],
447        "linkedIns": [],
448        "twitters": [],
449        "instagrams": [],
450        "facebooks": [],
451        "youtubes": [],
452        "tiktoks": [],
453        "pinterests": [],
454        "discords": []
455    },
456    {
457        "depth": 2,
458        "url": "https://cafelunanewyork.com/86144",
459        "domain": "cafelunanewyork.com",
460        "emails": [],
461        "phones": [],
462        "linkedIns": [],
463        "twitters": [],
464        "instagrams": [],
465        "facebooks": [],
466        "youtubes": [],
467        "tiktoks": [],
468        "pinterests": [],
469        "discords": []
470    },
471    {
472        "depth": 2,
473        "url": "https://cafelunanewyork.com/582559",
474        "domain": "cafelunanewyork.com",
475        "emails": [],
476        "phones": [],
477        "linkedIns": [],
478        "twitters": [],
479        "instagrams": [],
480        "facebooks": [],
481        "youtubes": [],
482        "tiktoks": [],
483        "pinterests": [],
484        "discords": []
485    },
486    {
487        "depth": 2,
488        "url": "https://cafelunanewyork.com/61565",
489        "domain": "cafelunanewyork.com",
490        "emails": [],
491        "phones": [],
492        "linkedIns": [],
493        "twitters": [],
494        "instagrams": [],
495        "facebooks": [],
496        "youtubes": [],
497        "tiktoks": [],
498        "pinterests": [],
499        "discords": []
500    },
501    {
502        "depth": 2,
503        "url": "https://cafelunanewyork.com/61566",
504        "domain": "cafelunanewyork.com",
505        "emails": [],
506        "phones": [],
507        "linkedIns": [],
508        "twitters": [],
509        "instagrams": [],
510        "facebooks": [],
511        "youtubes": [],
512        "tiktoks": [],
513        "pinterests": [],
514        "discords": []
515    },
516    {
517        "depth": 2,
518        "url": "https://cafelunanewyork.com/113622",
519        "domain": "cafelunanewyork.com",
520        "emails": [],
521        "phones": [],
522        "linkedIns": [],
523        "twitters": [],
524        "instagrams": [],
525        "facebooks": [],
526        "youtubes": [],
527        "tiktoks": [],
528        "pinterests": [],
529        "discords": []
530    }
531]

The actor stores its results in the default dataset associated with the actor run. You can download the results in formats such as JSON, HTML, CSV, XML, or Excel. Each record in the dataset includes:

  • URL: The page URL.
  • Email: Extracted email addresses.
  • Phone Number: Extracted phone numbers.
  • Social Media Profiles: Links to social media profiles (e.g., Facebook, Twitter, LinkedIn).

Resources

If you're looking for examples or want to learn more visit:

Documentation reference

To learn more about Apify and Actors, take a look at the following resources:

Frequently Asked Questions

Is it legal to scrape job listings or public data?

Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.

Do I need to code to use this scraper?

No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.

What data does it extract?

It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.

Can I scrape multiple pages or filter by location?

Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.

How do I get started?

You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!