Number of connections In this item you enter the number of simultaneous connections. As a rule 5 - 10 connections are made. The optimal number of connections will depend on the number of lines you have and the connection speed of your provider.
Save results automatically To save your results automatically every N of minutes. This option shows how frequently your interim search results are to be saved.
Time out for one connection This option gives the maximum amount of time in seconds during which each document (one connection) is downloaded.
At the end of this time the program starts downloading the next document.
Number of retries The number of attempts made to download each document.
This option shows the number of attempts to download the same file if the provider connection or website link is broken off. The program will make as many attempts to download as you specify.
Copy subdirectory structure from website - to copy the structure of a subdirectory from the website you wish to download. If this option is highlighted your hard drive will be able to create directories like the ones on the website you are downloading.
Apply domainname.com=www.domainname.com In some sites the hyperlinks to other sites contain no original www symbols and when the same documents are downloaded they may be inscribed twice in different directories. This option is designed to deal with this anomaly in Internet sites. If you highlight this option INTERNET-SOFT.COM and WWW.INTERNET-SOFT.COM will be treated as synonymous addresses. The address is automatically prefixed as www in this type of search.
Expand the nodes parents to make the node visible This convenience option is intended to graphically represent the tree of websites scanned. In this way the option shows the current branches of the site being downloaded and enables the program to graphically depict the locations where sites are downloaded.
Identify browser as This option shows how the program will be identified when the website is downloaded by a remote server.
For example, when you download a page using Internet Explorer 5.0, the remote server performs this operations and writes the contents of the server as a protocol. The Extractor program does the same thing when you visit a website.
Proxy Server Enter the proxy server properties (if you use a proxy server).
Then choose any other options you would like to use in downloading and searching for hyperlinks.
We would like to draw your attention to the following: Since the worldwide web contains a huge number of pages great data processing power may be needed as well as a large amount of disk space on your computer to download links and websites. A few hours of work by the program may take up many gigabytes on your hard disk.
File Type Filter: Limiting the types and sizes of filesYou can use this option to specify the types of files you want to download and limit their size.
This is important, for example, when you only want to download text documents without banners, pictures or archive files.
In this case, check the option beside html, htm, txt and shtml, etc. files. You can use these menu options to limit the size of files to be downloaded. If you have selected "Load all file sizes", files of all sizes will be downloaded. Otherwise you will only get the sizes (specified in bytes) you have selected.
URL / Domain Filter: Limitations by names of directories, domain names and files.
You can make limitations by entering certain words in domains. Let's say you're downloading files only from https://www.internet-soft.com.
You would only enter
internet-soft as the filter word.
The filter can be used separately:
- to adjust the word content in a domain name;
- to expand the domain;
- to adjust the contents of a certain word in a directory name;
- to modify any given word in the file name.
The filter can be used to include and exclude. If you have entered words into the exclude filter, this means that if the URL contains any of these words, the corresponding files will not be downloaded. If you opt for the include filter, this means that only the names containing the properties specified in the word filter will be downloaded.
Domains: Limitations by domain type. This option enables you to make limitations by type and country of the domain.
To do this click on the requested domain type. This is all you have to do for the
main program settings.
When you exit the menu window you save by default the data you have entered and you can proceed to download websites.
Now we can start a project. The default properties you have entered will automatically be called up when you start a new project. These properties can be altered and saved for a later time for each separate project.
The term "project" therefore refers to the total number of options that define which site and properties are to be downloaded.
Downloading a websiteIn order to download the website you need into your hard drive, first create a downloading project.
- Select Project on the main menu and then New. A window will appear for viewing websites and entering download properties;
- Now enter the address of the site you would like to download;
- Press Download / Extract.
The lower panel will show the pages of the website which is being downloaded; After this the websites are downloaded to your computer. Let's now take a more detailed look at the main control panels used in a downloading project.