SiteSearcher Documentation


Version 1.20

September 15, 1999

Copyright 1999 Zackware

Table Of Contents

 

Disclaimer of Warranty

THIS SOFTWARE AND THE ACCOMPANYING FILES ARE SOLD "AS IS" AND WITHOUT WARRANTIES AS TO PERFORMANCE OR ANY OTHER WARRANTIES WHETHER EXPRESSED OR IMPLIED. NO WARRANTY OF FITNESS FOR A PARTICULAR PURPOSE IS OFFERED.

Good computer software procedure dictates that any program be thoroughly tested with non-critical data before relying on it. The user must assume the entire risk of using the program.

 

Purpose

Have you ever tried searching the web for a particular file, only to quit in frustration after wading through twenty web pages? Or, do you know of a web site that has a lot of great content, but you don’t want to take the time to download everything from that web site manually? SiteSearcher was designed to solve these problems. SiteSearcher will search web sites for the files you specify, automatically following links to other web sites. Files can be specified by extension, so it is possible to search for ZIP files, JPG files, or any other type of files. When SiteSearcher finds a file, it can be downloaded automatically, or you can download it manually.

SiteSearcher uses many features of Windows 95 (and above) to make this process as streamlined as possible. It is possible to be exploring 5 web sites and downloading 200 files at the same time, with no interruption to SiteSearcher, or any other running applications.

We feel that you’ll find SiteSearcher an indispensable tool for mining the web.

 

How It Works

SiteSearcher works by basically reading the code that makes up web pages. When you tell SiteSearcher to "browse" a web page, it downloads the code that makes up that web page, and looks for two things. It first tries to find any references to a file. If it finds one, and the file matches all the criteria that you have specified, that file is a candidate for download. The second thing that SiteSearcher does, is search for references to other web pages. When it finds one, it looks at a different set of criteria to determine if that web page should be added to the list of valid web pages.

On the most basic level, SiteSearcher will just add all the files and web pages it finds to a list that you can see in it’s window. You can then decide to either download the files, or browse the web pages that it finds. If you set up some of the more advanced features of SiteSearcher, you can have it automatically follow the web pages it finds, and also automatically download the files it finds. You have a substantial amount of control over these aspects. You can specify how many sites to automatically explore, and you can specify exactly what kind of files you’d like to download.

 

Limitations

 

Un-Installation

If you have installed SiteSearcher using the provided install program, there will be an item in the Zackware start menu item that allows you to un-install SiteSearcher. Only the executable and documentation files will be removed. Any files you have downloaded using SiteSearcher will remain on your hard drive.

 

Glossary

Here are a few words that this document will use that has special meaning for this application.

 

User Interface

This is a screen shot of SiteSearcher in action.

[Image]

This screen is broken into two main areas or "panes". The left pane will hold a list of all web sites that have been found, and the right pane will show all files that have been found that match your criteria. These panes are described in detail below. What items show up in these panes are dependent on how you have many of SiteSearchers options set up.

There is also a typical Windows menu bar, a toolbar, and a status bar. The menu bar and all it’s items will be discussed below. The toolbar has several buttons on it that act as shortcuts to many of the menu items. They will not be discussed in detail. Instead, refer to the menu item descriptions to learn what they do.

The status bar will display useful information about the progress of SiteSearcher. Near the right side of the status bar are three small boxes that also convey status information. The first area will have a progress bar showing SiteSearcher's activity. This allows you to see that SiteSearcher is in fact doing something. The next box shows how many web sites SiteSearcher is browsing. The last box shows how many files are being downloaded by SiteSearcher.

Next, we go into more detail about the two main portions of the interface.

Sites Pane

This area of the window show various information about the various sites that SiteSearcher has or is searching. Any site listing that has a + in front of it can be expanded by left-clicking on the +, or by double clicking on the line itself. When you tell SiteSearcher to explore a web site, it will add any other web sites that it finds underneath the original web site.

There are three possible icons that show up in front of the site name that represent the state of that site.

  This site has not been browsed.

  This site is currently being browsed.

  This site has already been browsed.

It is also possible to bring up a context sensitive menu by right clicking on the list of sites. This menu has some of the same entries that the "Site" menu has. Those items are described below in the "Menu Bar" section.

If you double click on a site, SiteSearcher will browse that site using the default mode.  Also, you can use the <INSERT> and <DELETE> keys on your keyboard to add and delete sites.

Files Pane

This area of the SiteSearcher window shows information about files that have been found that match your file search criteria. It may also show the size of each file that it has found, but has not yet downloaded. It is possible to select files from the file list area for downloading. The normal Windows extended selection operations work in this window. You can "shift click" to select several sequential files, or use CTRL keys to select multiple non-sequential files.

The file size column of this pane may or may not be filled, depending on how you have your options set up. In some cases, it is not possible to find the size of a file before SiteSearcher downloads it. In this case <Unknown> will show up in this column. Other times, SiteSearcher may not try to determine the file size, and in this case this column will be blank. Otherwise, this column will show the number of bytes the file is in length.

It is also possible to bring up a context sensitive menu by right clicking in this area of the window. This menu has some of the same entries that the "File" menu has. Those items are described below in the "Menu Bar" section.

If you double click on a file, SiteSearcher will add that file to the internal download list.  You can also use the <DELETE> key on your keyboard to delete any selected files.

Menu Bar

SiteSearcher’s menu bar contains several sub menus that allow you to access the many features of SiteSearcher. These menus will be discussed in the order that they show up in the Menu Bar. Note that many of these menu items can be accessed by right-clicking in various portions of the window, or by using the toolbar.

File Menu

This menu contains primarily items related with files that SiteSearcher has found. These items are described below:

Site Menu:

This menu contains items relating to searching web sites. These items are described below:

View Menu:

This menu contains items related to viewing various areas of the user interface. These items are described below:

[Image]

This window helps you control the many features of SiteSearcher. The window is broken up into sections.  By clicking on the "tabs" at the top of the window, you can view the different sets of options.  When you are done making changes to the options, you can click on "OK" to accept your changes, or "Cancel" to discard your changes.

Note: Any changes you make to the option screen become effective immediately after you hit OK. Any searches that are occurring at the time of the option changes will see the changes.  Be aware of the repercussions that your changes will make.  The exception to this is Mode changes.  Mode changes will not effect any current searches.

File Types Tab
The settings on this tab allow you to specify what type of files to have SiteSearcher search for.  SiteSearcher looks at the file extension to determine what type of file it has found.  The file extension is the last section of the filename, following the period.  For example, the file extension of "FINDSITE.GIF" is "GIF".  If you're interested in looking for picture files, you might add the extensions "JPG", "GIF", and "PCX".  Pressing the "New" button adds a new entry, and allows you to edit it.  If you want to change an entry, select it from the list, and hit "Edit".  If you would like to remove an entry from the list, select it and click the "Delete" button.

File Size Tab
[Image]

The settings on this tab tell SiteSearcher if and when to check the length of found files.  These settings allow SiteSearcher to filter out files depending if they are too large or too small.  The first button, "Never" turns off file size checking.  The next two buttons, "When file is found", and "When file is downloaded" tell SiteSearcher when to do the check, either when a file is first found, or when the file is downloaded.  If you have this last option set, and you try to download a file that is too large or too small, SiteSearcher will simply ignore the request.

The Min Size and Max Size fields allow you to specify a range of sizes that you would like SiteSearcher to accept.  These sizes are in Kilobytes.

Exclude Tab
[Image]

The settings on the Exclude Tab allow you to specify which files and sites should be ignored by SiteSearcher.  Any entries in the list will be compared to all files and sites that SiteSearcher finds.  If the file name or the site address contains any of the items in the list, the file or site will be filtered out, and you will never see it in the file pane or the site pane.  Pressing the "New" button adds a new entry, and allows you to edit it.  If you want to change an entry, select it from the list, and hit "Edit".  If you would like to remove an entry from the list, select it and click the "Delete" button.

Misc Tab

[Image]

This tab contains various miscellaneous settings.  

The Max File Connections entry allows you to specify how many files are downloaded concurrently.  If you are using a modem line, 2 is probably a good setting.  The faster your Internet connection, the higher this number should be.

The Combo History Depth setting specifies how many entries SiteSearcher remembers for certain text entry fields. For example, if this field has a value of 5 in it, SiteSearcher will remember the last 5 text strings you searched for in the "Find Files" window.

The Ask for Site at start up option controls whether or not SiteSearcher asks for a site to search when it is first run.

The Enable Password Logins options tells SiteSearcher whether or not to ask you for a password when searching a secure site.  If you are using SiteSearcher on a site that you need to log into, you should turn this option on.  If you aren't using SiteSearcher to search these kinds of sites, turn this option off.

The Automatically download files check box turns on and off automatic downloads.  This option is only available in the registered version of the program.  When selected, and file that passes all the file filters will be automatically downloaded.  It will not show up in the file pane, but instead be added to the internal list of files to download.

Modes Tab

[Image]

The settings on this tab allow you to modify the three default Modes that SiteSearcher uses to look for files.  Modes are the most difficult portion of the options to understand.  Most people will probably never need to modify the default modes, since they represent the most common ways SiteSearcher works.  However, if you do have a need to modify a mode, this is the screen to do it on.

A "Mode" tells SiteSearcher what kind, and how many links to follow on each site it finds.  A mode really controls the breadth of the search SiteSearcher does when searching for files.

There is always a default mode.  The default mode is used whenever you double-click on a site to explore it.  It also will be the mode automatically selected when you enter the "Add Site" screen.  To change the default, select the new mode to use as default, and click on the Set As Default button.  "*DEFAULT*" will show up to the right of the item.

The three default modes work as follows:

If you would like to modify one of these modes, select it from the list, and click the "Edit" button.  The following window will appear:

[Image]

The Mode Name setting contains the text that will show up in the various menus and windows that allow you to specify a mode.  This name should be kept a short as possible, in order to nicely fit into menus and windows.

Under Local Sites, you can specify the maximum number of local sites to follow in the Follow Max setting.  If set to 0, SiteSearcher will only search the page you specify, and no other local pages.  If the Follow Parent Sites option is not check-marked, SiteSearcher will only follow links that refer to a sub-page of the current page. For example, if SiteSearcher was on a page with the address of www.aa.bbb/compa, a reference to www.aa.bbb would not be followed. However, a link to www.aa.bbb/compa/divb would be followed. This comes in handy if you want to only search a specific sub-section of a large site.

Under External Sites, there are several settings.  

Help Menu:

Contact Information

Any bug reports, questions, or comments should be e-mailed to:

Zackware@hotmail.com

The home page of SiteSearcher is located at

www.satacoy.com

Distribution of Demo Version

You are hereby licensed to make as many copies of the evaluation version of this software and documentation as you wish; give exact copies of the original evaluation version to anyone; and distribute the evaluation version of the software and documentation in its unmodified form via electronic means. You are specifically prohibited from charging, or requesting donations, for any such copies, however made, without prior written permission of Zackware.

Distribution of Full Version

Distribution of the full release of SiteSearcher is prohibited, except through authorized channels.