LightBlog

lundi 18 novembre 2019

Scrape Websites for Valuable Data with the Scrapestack API

There are plenty of great ways to find information on the internet. Still, there is some information that’s harder to find, especially if you’re looking for very specific things. Web scraping services are a great tool for this. Scrapestack is an API that lets you manage your own web scraping.

A web scraper is a tool that will visit a website and take a copy of a specific type of data. Rather than you going through everything manually, just give the scraper data to look for and it will do all the work. Scrapestack is more powerful than the free website scrapers out there. It provides better security, better reliability, and a top of the line server.

Getting Started

Getting started with Scrapestack is very easy. First, you’ll need to set up an account. Once you have an account, you will be given an API key. This is what you will use to access the API, unsurprisingly. You can then use the target URL to scrape data from any website you want.

http://api.scrapestack.com/scrape?access_key=YOUR_ACCESS_KEY&url=https://microsoft.com

In the example above you would be scraping Microsoft’s website. All you need to do is place your API key in the appropriate spot and change the domain at the end. You will then be given the page’s contents without JavaScrip and CSS styles.

There are other parameters you can add to that target URL to further refine things.

access_key [Required] Specify your unique API access key to authenticate with the API. Your API access key can be found in your account dashboard.
url [Required] Specify the URL of the web page you would like to scrape.
render_js [optional] Set to 0 (off, default) or 1 (on) depending on whether or not to render JavaScript on the target web page. JavaScript rendering is done using a Google Chrome headless browser.
keep_headers [optional] Set 0 (off, default) or 1 (on) depending on whether or not to send currently active HTTP headers to the target URL with your API request and have the API return these headers along with your API response.
proxy_location [optional] Specify the 2-letter code of the country you would like to us as a proxy geolocation for your scraping API request. Supported countries differ by proxy type, please refer to the Proxy Locations section for details.
premium_proxy [optional] Set 0 (off, default) or 1 (on) depending on whether or not to enable premium residential proxies for your scraping request. Please note that a single premium proxy API request is counted as 25 API requests.

Pricing

There are several pricing tiers for Scrapestack. The free tier offers 10,000 API requests, standard proxies, and limited support. The Basic tier adds 250,000 API requests, HTTPS Encryption, concurrent requests, and unlimited support. Basic costs $19.99 a month. Bumping up to the Professional tier gets you 1,000,000 requests, and then the Business tier goes all the way up to 3,000,000 requests. They cost $79.99 and $149.99 a month, respectively.

We thank scrapestack for sponsoring this post and providing the content. Our sponsors help us pay for the many costs associated with running XDA, including server costs, full time developers, news writers, and much more. While you might see sponsored content (which will always be labeled as such) alongside Portal content, the Portal team is in no way responsible for these posts. Sponsored content, advertising and XDA Depot are managed by a separate team entirely. XDA will never compromise its journalistic integrity by accepting money to write favorably about a company, or alter our opinions or views in any way. Our opinion cannot be bought.

The post Scrape Websites for Valuable Data with the Scrapestack API appeared first on xda-developers.



from xda-developers https://ift.tt/37qmZMK
via IFTTT

Aucun commentaire:

Enregistrer un commentaire