You can use the tool to extract email addresses from a website to bypass a specific site. The basic keyword search feature allows you to quickly launch a new search. The mass search tool is designed to handle multiple sites, or more precise keyword parsing.
To run the mass search tool, click on “Bulk search” button.
Method 1 – By the list of domains/URL addresses
Specify a list of URLs or domains for bypassing. The program will follow the links, download the pages, extract email addresses and other contacts. In the Crawl Depth parameter, you can specify whether to download pages the links of which will be detected on the pages from the initial list. Crawl Depth = 0 means that LetsExtract should only load the original URL.
Method 2 – By keywords
By clicking the Paste templates button you can insert a basic query template which can be modified to meet your demands. This feature allows you to customize your keyword search more precisely.
- Remove unwanted search engines.
- Enclose the parameter values in quotation marks.
- You can replace the Engine parameter with the domain of your desired search engine.
- Replace the Keyword parameter with a keyword or phrase for the selected search engine.
- The parameter Depth means how many pages to visit from the results (new search results may end earlier). A value = 0 means the first page of the search engine..
- The Max parameter indicates how many links from each search engine page should be bypassed at most. A value = -1 means “no limit”.
- The value Crawl Depth means how many pages LetsExtract should load. With value = 0, the program will load only the page from search results.
Example: We want to search for contacts only on sites from the first page of Yandex (Russia) and Google (UK). And with no limit on the number of results found on this one page. In this case, our templates will look like this:
{Engine="google.co.uk" Keyword="Buy good tea in London" Depth="0" Max="-1"}
{Engine="yandex.ru" Keyword="Manufacture plastic windows" Depth="0" Max="-1"}
We also want LetsExtract to open each site found, and view only the first pages linked to in the search results (not dive deep). To do this, let’s set the overall value of Crawl depth = 1. Let’s run the search:
Method 3 – Generated URL list
Some websites have pages like this:
http://website.com/forum/members?id=12932
To avoid loading all pages of such sites you can generate a list of the required URLs. To start the generator, click Generate Links. Edit the template, click the Generate button, check a few generated addresses, and click “OK” to return to the main window.