Introduction to Email Extractor™

Email Extractor™ is your personal Web crawler that automates the retrieval and selection of e-mail addresses available on the World Wide Web. The product allows you to search for e-mail addresses available on a Web site or sites you specify. The resulting reports supply you with a list of e-mail addresses the product extracts from Web pages being scanned. The product builds it reports either as HTML files allowing you to view them or, as MS Excel® spreadsheets for further editing, sorting, or exporting e-mail addresses into your personal databases.

You may operate Email Extractor™ in the background while you work on other tasks. When the search is complete, Email Extractor™ creates a report of its findings which you may access using the product built-in browser, your default one, MS Excel® or, store it for later viewing, editing or processing by external software packages. Both speed and performance make Email Extractor™ an essential tool for investigating the Web.

  1. Getting started with Email Extractor™
  2. What can Email Extractor™ do for me?
  3. How does Email Extractor™ work?
  4. System requirements

1. Getting started

When you run Email Extractor for the first time, displayed is a window entitled "Email Extractor - NONAME". This is a sample query folder containing several queries you may use to get up and running immediately. This allows you to quickly get familiar with Email Extractor as well as check its search capabilities. You may enter new queries into the "NONAME" sample folder, rename it or, create your own custom query folders based on subjects of interest and Web sites you provide.

Search results are stored in these folders allowing you to build up a portfolio of search reports grouped by your favorite topics or sites:

 Sample Query Folder

The first couple of rows of the sample query folder have queries such as, "www.ibm.com", etc. that are addressed to specific Web Sites. To execute any of these queries, just double-click a corresponding icon using the left mouse button or press the Run button on the application toolbar.

You may also enter new queries by clicking the New button on the application toolbar or, by pressing the combination of keys <Ctrl+N>. If, for example, you wish to enter a similar query for AOL (located on the Web at "www.netscape.com" and "www.aol.com"), press Ctrl+N, then point the mouse to the Web Sites field on the left pane (Query Properties), and type "www.netscape.com; www.aol.com". Simply press the Run icon on the toolbar, after which Email Extractor will start extracting e-mail addresses and building a report of its findings.

Email Extractor automatically uses defaults under the other fields on the Query Properties pane using a set of standard preferences. These preferences are user definable, however, the standard defaults have been tuned to optimize initial searches for you as a beginner.

If you are not connected to the Internet, you will be prompted to connect to your on-line provider via the Dial-Up Networking "Connect to..." dialog (applicable to those customers who use the telephone dial-up connection). Once you are connected to the Internet, Email Extractor will begin processing your queries. As soon as the first query is processed (either within the time or document limit set in the Query Options pane on the left side of the product window), you'll hear a voice sound telling "Your report is ready" (in English). This means that Email Extractor has completed the query and loaded the report with its findings into the browser (either built-in or your default one) or, into MS Excel® if specified so in the product Options dialog.

You may browse this report while Email Extractor continues gathering information for other queries, which once complete, are also automatically loaded into your browser.

Shown below is an example report built by Email Extractor and containing e-mail addresses the product has extracted from www.netscape.com. Email Extractor has placed the report in the NONAME Query Folder. You may view any of these queries at a later date by opening an appropriate query folder (i.e., noname.mef) and clicking the Report button. Pressing the Library button calls the Librarian which lists all reports on your search queries.

Sample report: E-mails extracted from "www.netscape.com" and "www.aol.com"

Note: Actual e-mails and links in this example report have been modified by
inserting the three-dotted sign (...) or replaced with those to the Kryloff Web site.

E-mail:Found on page:Page title:Page modified:
yellow...@aol.comhttp://yp.netscape.com/main.adp?pid=...Netscape Yellow Pages - Your Online Business Directorytoday
web...@netscape.comhttp://wp.netscape.com/.../netcenter/Netscape Centertoday
privacy...@aol.comhttp://about.aol.com/.../aolnetwork/Privacy Policy - About AOLtoday
gencom...@aol.comhttp://www.aol.com.au/.../copyright.phpAOL Terms and Conditions of Usetoday
help...@primusonline.com.auhttp://www.aol.com.au/.../privacynotice.phpPrimus-AOL Privacy Noticetoday
shop...@aol.com.auhttp://shop.aol.com.au/...Welcome to Shop@aol.com.autoday
aussie...@aol.comhttp://www.aol.com.au/.../contact_aol.phpContact Primus-AOLtoday
aperry...@primustel.com.auhttp://www.aol.com.au/.../contact_aol.phpContact Primus-AOLtoday
ozads...@aol.comhttp://www.aol.com.au/.../submission.phpAdvertise with Us - Submissiontoday
usua...@aol.comhttp://www.aol.com.mx/ayuda/...AOL Mxico -today
darbenefic...@aol.comhttp://www.aol.com.mx/.../beneficios/AOL Mxico - Beneficiostoday
funkalic...@joannebz.comhttp://www.aol.com.au/.../joanne.phpAOL Live Transcriptstoday
ramon...@aol.comhttp://www.aol.com.pr/...AOL Puerto Ricotoday
info...@eforcers.comhttp://aol.tarjetasnico.com/ar_es/...Aol - Argentina :: Las mejores tarjetas virtuales y postales animadas en Internettoday
margarit...@aol.comhttp://www.aol.com.pr/.../seasonals.pdfUntitled - PDF Fileyesterday
rosario...@aol.comhttp://www.aol.com.pr/.../seasonals.pdfUntitled - PDF Fileyesterday
joan...@aol.comhttp://www.aol.com.pr/im/.../seasonals.pdfUntitled - PDF Filelast year
tom...@nccn.nethttp://search.aol.com.au/?q=Equatorial...Primus Onlinetoday
hilda...@aol.comhttp://www.aol.com.pr/.../invita.htmlAOL PRyesterday
tosare...@aol.comhttp://www.forosaol.com.ar/.../index.phpAOL Forostoday
web...@ucy.ac.cyhttp://search.aol.com.au/?q=Cyprus...Primus Onlinetoday
web...@www.kzhttp://search.aol.com.au/?q=Kazakhstan...Primus Onlinetoday
 ... (this example report has been truncated) 

See also: Submitting Queries and Viewing Results

2. What can Email Extractor do for me?

Email Extractor provides you with a wide range of smart, unique, and easy-to-use searching capabilities:

3. How does Email Extractor work?

Email Extractor obtains e-mail addresses in several steps:

Step 1. Having entered one or more URL's of Web sites you are going to search in, launch your queries by clicking the main menu items Query, and then Run or Run off-line. Email Extractor starts processing your queries.

Step 2. Email Extractor recognizes hyper-links on Web pages and automatically follows them (receiving up to 20 documents at a time) allowing to build a Web site index. Starting from the root of the tree (the URL or URL's you type into the Search Sites field), Email Extractor first goes horizontally through the site looking for all documents on a given level and then vertically avoiding possible loops and hyper-links that fall outside the Web site(s) under investigation.

Step 3. Email Extractor parses each document using an intelligent e-mail extraction engine embedded into the program, after which the product creates a corresponding report containing all of its findings.

Step 4. The final step for Email Extractor is to place the reports into a specified library and (if user preferences are set up to do so), launch the built-in, your default Internet browser or MS Excel® display the query report with e-mail addresses found.

Step 5. Email Extractor repeats the above steps until all the queries you issued have been completed. This way you may run multiple queries and as Email Extractor works in the background, continue working on other tasks.

Note:
Email Extractor keeps the most recently received documents in the Temporary Internet Files folder for subsequent searches. This allows Email Extractor to quickly access these documents next time you launch a new query or work off-line.

Example:
Assume, you want to investigate the Kryloff Technologies Web Site (located at "www.kryltech.com")
. Type in or select www.kryltech.com under the Web Sites field in the Query Options pane. Click Query, then Run from the menu bar. Below is a simplified example scheme of this Web Site, which shows documents and hyper-link connections between them:

Root:  
www.kryltech.com
  (link 11) (link 12) (link 13)  
Level 1:
www.kryltech.com/news.htm
www.kryltech.com/aboutkt.htm
www.kryltech.com/langsupp.htm
  
  (link 21) (link 22) (link 23)   (link 31)
Level 2:
www.pkware.com
www.kryltech.com/download.htm
www.kryltech.com/feedback.htm

1. Email Extractor starts investigating by cataloging the root document located at www.kryltech.com.

2. When the document is received, then fully scanned and processed, Email Extractor follows the links 11, 12 and 13 and retrieves documents "www.kryltech.com/news.htm", "www.kryltech.com/aboutkt.htm", and "www.kryltech.com/langsupp.htm".

3. Having investigated "www.kryltech.com/news.htm", Email Extractor does not follow link 21 as it goes outside the www.kryltech.com Web Site. Links that lead Email Extractor outside a specified Web Site are always skipped by the product.

As there are no more links in "www.kryltech.com/news.htm", Email Extractor extracts new links from "www.kryltech.com/aboutkt.htm" and "www.kryltech.com/langsupp.htm" and continues investigation by following links 22 and 23.

4. When "www.kryltech.com/download.htm" and "www.kryltech.com/feedback.htm" have been downloaded, Email Extractor finishes investigating this Web Site as the only new link (31) leads to a previously loaded document. Indeed, link 31 points to the root www.kryltech.com, a location from which Email Extractor has started its investigation.

5. The resulting report contains a list of all e-mail addresses that have been found on the pages within the "www.kryltech.com" Web site.

See also: Submitting Queries and Viewing Results and Editing Email Extractor Preferences

4. System requirements

Help Contents