Dark Web Scraper

Dark Web Scraper

Uncover valuable insights with our Dark Web Scraper. Extract sensitive data, including crypto wallets, API keys, emails, phone numbers, and more, from the depths of the Dark Web. You can specify search terms, and customize and retrieve OSINT data out of the box.

AUTOMATIONNEWSApify

Actor - Dark Web Scraper

Dark Web Scraper

This actor allows you to scrape Dark Web sites. Within the OSINT support, you can use this actor to find sensitive information.

The Dark Web data scraper supports the following features:

  • Search any keyword - You can search any keyword you would like to have and get the results.

  • Scrape sensitive information - Scrape any sensitive information like emails, phones, API Keys, or crypto wallets from Dark Web.

  • Scrape TOR - Since the actor supports TOR proxy, you can scrape any information up to your needs.

  • Customizable - If you are looking for specific data, you can develop your function and integrate it into the scraper.

Bugs, fixes, updates, and changelog

This scraper is under active development. If you have any feature requests you can create an issue from here.

Input Parameters

The input of this scraper should be JSON containing the list of pages on the Dark Web that should be visited. Possible fields are:

  • search: (Optional) (String) Keyword that you want to search on Dark Web

  • startUrls: (Optional) (Array) List of Dark Web URLs.

  • maxDepth: (Optional) (Number) Maximum depth the scraper will dive into. If you want to scrape a Dark Web site in a very superficial way, you can set this with a low number.

  • maxPages: (Optional) (Number) You can limit scraped pages. This should be useful when you search through the big websites.

  • maxItems: (Optional) (Number) You can limit scraped items. This should be useful when you search through the big lists or search results.

  • extendOutputFunction: (Optional) (String) Function that takes a JQuery handle ($) as an argument and returns an object with data.

Tip

When you want to scrape over a specific URL, just copy and paste the link as one of the startUrl.

If you would like to scrape the website quickly without diving into its deeper pages, you can set maxDepth option within a lower number.

Compute Unit Consumption

The actor is optimized to run blazing fast and scrape many as pages as possible. If the actor doesn't block very often it'll scrape 100 pages in 5 seconds with ~0.1-0.2 compute units.

Dark Web Scraper Input example

1{
2  "startUrls":[
3    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/"
4  ],
5  "search": "Dark Web",
6  "maxDepth": 5,
7  "maxPages": 10,
8}

During the Run

During the run, the actor will output messages letting you know what is going on. Each message always contains a short label specifying which page from the provided list is currently specified. When items are loaded from the page, you should see a message about this event with a loaded item count and total item count for each page.

If you provide incorrect input to the actor, it will immediately stop with a failure state and output an explanation of what is wrong.

Dark Web Export

During the run, the actor stores results into a dataset. Each item is a separate item in the dataset.

You can manage the results in any language (Python, PHP, Node JS/NPM). See the FAQ or our API reference to learn more about getting results from this Dark Web actor.

Scraped Properties

The structure of each item in Dark Web Scaper looks like this:

Item Detail

1{
2  "url": "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/cellar-door/",
3  "links": [
4    "http://ogp.me/ns",
5    "http://ogp.me/ns/fb",
6    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/cellar-door/",
7    "https://schema.org/WebPage",
8    "https://schema.org/SiteNavigationElement",
9    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/",
10    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/faq/",
11    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/support/",
12    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/privacy-notice/",
13    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/sitemap/",
14    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/stream-recording/",
15    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/art/",
16    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/color-artwork/",
17    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/music/",
18    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/sketch-books/",
19    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/category/txt/stories/",
20    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/text/",
21    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/category/txt/blog/",
22    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/category/txt/guides/",
23    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/category/txt/metaverse/",
24    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/category/txt/rant/",
25    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/download/",
26    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/audio-testing/",
27    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/contact/",
28    "https://schema.org/CreativeWork",
29    "http://www.anubianhost.com/account/aff.php",
30    "https://floatoverblow.com",
31    "http://furrystuff.com/",
32    "http://www.infochammel.com/",
33    "http://www.jillianmayer.net/",
34    "http://www.kainless.com/",
35    "http://www.kainless.com",
36    "https://www.jwz.org/",
37    "https://lameazoid.com/",
38    "https://emreed.net/LowTech_Directory.html",
39    "https://wiby.me/",
40    "https://web.archive.org/web/20080915204851/http",
41    "https://www.nevermindstu.com",
42    "https://www.youtube.com/channel/UC-CEC8G9v3ry2VOllsJNVYw",
43    "https://www.twitch.tv/funkyjoe86",
44    "https://www.numou.net/",
45    "https://aronsmusings.wordpress.com/",
46    "https://physonyl.net",
47    "https://protohub.online/",
48    "https://blog.protohub.online/JourneyOfTheProtogen/",
49    "http://www.rantradio.com/",
50    "http://www.rant.social/",
51    "https://repair.org",
52    "http://www.thisismymilwaukee.com",
53    "https://www.youtube.com/watch",
54    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/tor-networks/",
55    "http://phobosxilamwcg75xt22id7aywkzol6q6rfl2flipcqoc4e4ahima5id.onion/",
56    "http://underdiriled6lvdfgiw4e5urfofuslnz7ewictzf76h4qb73fxbsxad.onion/",
57    "http://wclekwrf2aclunlmuikf2bopusjfv66jlhwtgbiycy5nw524r6ngioid.onion",
58    "http://meynethaffeecapsvfphrcnfrx44w2nskgls2juwitibvqctk2plvhqd.onion",
59    "http://galaxy3bhpzxecbywoa2j4tg43muepnhfalars4cce3fcx46qlc6t3id.onion",
60    "http://tor66sewebgixwhcqfnp5inzp5x5uohhdy3kvtnyfxc2e5mxiuh34iid.onion",
61    "http://y5wnzw4e6i7srm2gqadlow5anhlaj5avdkzbwzbmrxwkygxdp7ffieqd.onion/",
62    "http://absjpxsvyn5cboihzenbyfngq224rpvtfgnehwwvkhjm3gmk6oruhoad.onion/blog/",
63    "http://qrtitjevs5nxq6jvrnrjyz5dasi3nbzx24mzmfxnuk2dnzhpphcmgoyd.onion/",
64    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/lbry-and-odysee-a-video-hosting-review/",
65    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/web-3-0/",
66    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/clarification-time/",
67    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/nginx-image-format-wars/",
68    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/presearch/",
69    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/site-supported-codec-guidelines/",
70    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/web-3-0/comment-page-1/",
71    "http://www.musex.space",
72    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/rollerblade-office-caster-wheels-buyer-beware/comment-page-4/",
73    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/chinese-xbox-360-wireless-receiver-driver-setup/comment-page-35/",
74    "http://wolfballs.com",
75    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion/xbox-dvd-remote-everything/comment-page-1/",
76    "https://schema.org/WPFooter",
77    "https://www.s-config.com",
78    "http://xjfbpuj56rdazx4iolylxplbvyft2onuerjeimlcqwaihp3s6r4xebqd.onion"
79  ],
80  "emails": [],
81  "phones": [],
82  "cryptoAddresses": {},
83  "misc": {
84    "Twitter username": [
85      "@S_Config"
86    ],
87    "Instagram username": [
88      "@S_Config"
89    ]
90  },
91  "searchKeywordFound": false
92}

Contact

Please visit us through epctex.com to see all the products that are available for you. If you are looking for any custom integration or so, please reach out to us through the chat box in epctex.com. In need of support? business@epctex.com is at your service.

Frequently Asked Questions

Is it legal to scrape job listings or public data?

Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.

Do I need to code to use this scraper?

No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.

What data does it extract?

It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.

Can I scrape multiple pages or filter by location?

Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.

How do I get started?

You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!