Advanced Query Options

The "Advanced Query Options " dialog allows you to view and modify advanced options of a postponed or a completed query. The dialog contains a number of items; each of these items affects the way SSSpider executes the selected query. These options do not affect the rest of queries: to change the default query settings, go to the SSSpider Options dialog.

Options in the groups General and Domain Names are applicable to all queries. Options in the group Comprehensive Reports are meaningful only if the comprehensive report type is selected. To get help on a particular box or label, click the corresponding item on this picture:

Click an item of your interest to get help

1. Query Text
Up to the Options picture

The Query box provides space in which you can enter or modify the text of your query.

2. Query Language
Up to the options picture

SSSpider supports searches in a large number of languages (see SSSuite Multi-Language Support for details). If you need to find information in a language other than English, select the language of your choice.

The SSSpider smart language detecting algorithms perform automatic conversion of documents being processed, into the selected coding table. It is especially required for those languages, which allow data representation under several coding tables, such as Japanese or Russian. For example, by selecting Japanese SHIFT JIS for a query language, you are instructing SSSpider to convert all incoming documents into SHIFT JIS (Japanese Industrial Standard) encoding table; this conversion is performed even for those Web sites, which publish their pages in Japanese EUC, Japanese New-JIS, Japanese Old-JIS and Japanese NEC-JIS encoding. So, for these languages you should select the coding table in which you enter queries; than the required conversion is automatically done by SSSpider.

3. Locations to visit
Up to the options picture

Specifies the total number of documents SSSpider should receive and investigate when processing this query. Set this field to zero or just clear it if you want SSSpider to investigate as many Web documents as it can within a given time period.
See also: Timeout for processing queries.

4. Timeout for processing the query
Up to the Options picture

This field allows you to control how long (in minutes) you would like SSSpider to spend processing the query. Enter zero in this field (or, just empty) if you do not wish to set up any time limit. In this case SSSpider will keep on working until it either downloads the required number of documents or you press the Report now button on the Query Execution Screen.
See also: Locations to visit.

5. Quotations to generate (applicable to comprehensive reports only)
Up to the Options picture

Fill in this field with the number of quotations to be extracted from documents SSSpider will retrieve when processing the query. The quotations included in a report are based on the SSSpider ability to match information contained in the documents it scans, to the search phrase you entered. The value you enter in this field should be between 20 and 1,000.
See also: How SSSpider works.

6. Report Type
Up to the Options picture

With the SSSpider smart text search engine you have the possibility to investigate each Web page provided by engines from the Search Set you've selected. As a result, you will be given a comprehensive report, which contains the most relevant excerpts from different Web pages selected by the SSSpider engine.

If you are low on time and do not want to wait until SSSpider finishes the detailed investigation, you may instruct SSSpider to generate a quick report, which will contain just short descriptions or summaries of each Web page that can possibly answer this query. Quick reports are built based on the information which SSSpider obtains from search engines and therefore, they do not contain quotations from the source documents.

7. Search Method
Up to the Options picture To find paragraphs which match your Search Phrase, SSSpider offers a number of different search methods, namely:
  1. The "Subject search" method is capable of selecting paragraphs, in which your search phrase might be altered, modified, or even misspelled. This method is especially useful when searching for information you are just getting familiar with. This method accepts natural queries (entering something like "how can I rent a car in Italy" is OK), so it is the most powerful and the convenient one.
    Despite this fact, in a number of cases selecting one of the following Literal Search methods SSSpider also provides you with, may be more preferable. For example, when you are looking for occurrences of a particular person's name, geographical location, or/and number or numbers.

  2. The literal search method "All words in same order ignore case" selects documents containing all of the words in your phrase following each other and separated by one or several spaces or/and line breaks. For example, "Kryloff Technologies" is matched to:
    a) "  Kryloff        Technologies  ",
    b) "KRYLOFF (end of line)
       TECHNOLOGIES
    ", etc., but
    c) not to "Kryloff Technology".
    To match the words "Technology" and "Technologies", use the Subject Search method described above.
  3. The literal search method "Entire pattern ignore case" selects documents containing your search phrase as you type it in (including leading and trailing spaces, special characters, etc.). For example, "Kryloff Technologies" gets matched to just "Kryloff Technologies", "KRYLOFF technologies", but not to "KRYLOFF   (two or more spaces or line breaks)   Technologies".
  4. The literal search method "All words from the phrase ignore case" allows finding documents that contain just all words in your Search Phrase. You may enter only some part of the words. As an example, the search phrase "Kryl Tech" will be matched to "Kryloff Technologies"; also selected will be any other documents containing, for example, the words "Kryloff" and "Technical", "Kryloff" and "Technology", (anywhere in the text or document title), etc.
  5. The literal search method "At least one word ignore case" selects documents containing at least one (any) word you have entered into your search phrase. For example, issuing the query "Kryl Tech" will cause SSSpider to select documents which contain either "Kryloff" or "Technologies", or both as well as "Technology", "Technical", etc.

    All methods described above (including the "Subject Search" one) are case insensitive: the patterns "Kryloff", "KRYLOFF" and "kryloff" are considered to be the same. The following four methods make distinction between uppercase and lowercase characters when searching for occurrences of your search terms; the rest of functionality of these methods has been just described above in this section:

  6. The literal search method "All words in the same order match case" makes distinction between capital (uppercase) and small (lowercase) characters. See also: "All words in same order ignore case".
  7. The literal search method "Entire pattern match case" selects documents which contain your phrase bodily (as you type it in) without any changes. See also, "Entire pattern ignore case" described above.
  8. The literal search method "All words from the phrase match case" is identical to the method "All words from the phrase ignore case" described above except for the fact that it makes distinction between capital and small characters.
  9. The literal search method "At least one word match case" is also similar to "At least one word ignore case", but it treats capital and small letters as absolutely different ones and does not match them to each other (even if they represent the same sound in the language of your choice).
8. Query Status
Up to the Options picture

The Query Status field reflects the current status of the query: it can be either Postponed, Running, or Completed.

9. The Show Report button
Up to the Options picture

Click this button to see the report built by SSSpider. For postponed and running queries, this button is grayed out.

10. The Delete Report button
Up to the Options picture

Click the button to delete the report that SSSpider has built. Once the report is deleted, the query status gets changed to "Postponed", after which you may modify the rest of the query options and run this query, again. For postponed and running queries, this button is grayed out.

11. Minimum % (percentage) of relevancy with which quotations are to be included into the final report
Up to the Options picture

The Min. Relevancy field (Minimum percentage of relevancy) is only relevant to those queries for which you are going to or have obtained a comprehensive report. SSSpider generates reports with the most relevant quotes/extracts from the documents accessed during the search. Each quotation is ranked for relevancy with some value in the 0%-100% range. This option instructs SSSpider to skip those quotations which do not exceed the value you specify in this field. The higher the value, the fewer quotes. As a general rule, 20%-50% is a good value to place here.
See also: How SSSpider works, step 3.

12. Size of quotations (applicable to comprehensive reports only)
Up to the Options picture

This options tells SSSpider how many characters (approximately) are to be extracted from downloaded documents and placed into the final report. A value of 500-1,500 characters is a reasonable one as it allows you to read the most relevant paragraphs extracted from different Web pages by looking directly through the report. That way you can quickly determine whether a document contains information that is relevant to your search.

13. Controlling the way SSSpider should extract quotations from a particular document
Up to the Options picture

If the box "Allow multiple quotations from the same document" is checked, SSSpider may include in its report more than one quotation from the same document. This may happen for big documents, which obviously, may contain information that is relevant to the subject of your query, in two, three or more paragraphs. Apart from fully enabling or disabling the appearance of several quotations from the same document, you may control the way how SSSpider extract quotations. Specify the number of characters SSSpider should skip when it places a quotation into the report before it can resume quoting the same document, again. If the option "Skip Between Quotations" is not specified, (or, is set to zero) SSSpider includes as many relevant quotes as it finds.

14. Date of creation/last modification (applicable to comprehensive reports only)
Up to the Options picture

If checked, this option instructs SSSpider to process only documents that have been created or modified on or after the date you specify.

Note: Some Web servers do not report the date a particular document or Web page has been last modified or created. If these documents are still relevant to the subject of your queries, SSSpider does include corresponding quotations in its reports.

15. Domain suffixes
Up to the Options picture

Enter domains suffixes to narrow your search to the specified domains only. For example, to make SSSpider process pages, which URL's end up with either ".net" or ".com", enter each of them clicking the Add Suffix button.

If you do not specify any domain suffixes, SSSpider will download and investigate all documents. If the list of domains is not empty, then before downloading each document SSSpider will be checking if its Internet address (URL) is within the list of domain suffixes you have specified. If it is, SSSpider downloads, investigates, and if the document contains relevant information, includes corresponding quotation(s) into the final report.

Note: Some search engines hide true locations of pages in their reports (i.e., provide links to some intermediate Web servers, which redirect your browser to the original document location only in one or several steps). Despite a high enough level of the SSSpider intelligence, this may still cause the product to skip pages it should have processed. It is recommended that you check if SSSpider does correctly resolve these links by issuing a couple of test queries: one with and one more without domain suffixes, and only after it start running queries with a non-empty set of domain suffixes. As one more recommendation, include in your list domain suffix(es) of the engines you use.