--- ../htdig3-1-x/htdoc/FAQ.html Thu Jan 31 11:45:37 2002 +++ htdoc/FAQ.html Fri Feb 8 15:33:19 2002 @@ -408,6 +408,12 @@ You can find the answer yourself to almost all "how can I..." questions by exploring what the various configuration attributes and search form input parameters can do. + Also have a look at our collection of + Contributed Guides + for help on things like + HTML + forms and CGI, tutorials on installing, configuring, using, and + internationalizing ht://Dig, as well as using PHP with htsearch.

Finally, if you've exhausted all the online documentation, there's the @@ -503,7 +509,7 @@ version of ht://Dig?

The ht://Dig package can be built on the Win32 platform when using the Cygwin package. For details, see the contributed guide, - + Idiot's Guide to Installing ht://Dig on Win32.

@@ -719,7 +725,7 @@ following line would do it:

For CGI and PHP, you need a "wrapper" script to do that. For perl script examples, see the files in contrib/ewswrap. The PHP guide (see contributed + href="http://www.htdig.org/contrib/guides.html">contributed guides) not only describes a wrapper script for PHP, but also offers a step by step tutorial to the basics of ht://dig and is well worth reading. @@ -1235,7 +1241,7 @@ require group foo 4.22. How can I get htdig to ignore the robots.txt file or meta robots tags?

You can't, and you shouldn't. The - + Standard for Robot Exclusion exists for a very good reason, and any well behaved indexing engine or spider should conform to it. If you have a problem with a robots.txt file, you really should @@ -1857,7 +1863,7 @@ find /path/to/your/document/root -type f

forbidden by server robots.txt!
A substring of the URL matches one of the items disallowed in the servers robots.txt file. See - + A Standard for Robot Exclusion. This message exists only in the 3.2.0 betas. In 3.1.5 and earlier, this condition is only caught later, resulting in the message --- ../htdig3-1-x/htdoc/htdig.html Sun Jan 27 07:44:57 2002 +++ htdoc/htdig.html Fri Feb 8 15:33:19 2002 @@ -218,7 +218,7 @@ htmerge, htsearch, Configuration file format, and - + A Standard for Robot Exclusion.
--- ../htdig3-1-x/htdoc/meta.html Sun Jan 27 07:44:57 2002 +++ htdoc/meta.html Fri Feb 8 15:33:19 2002 @@ -211,8 +211,8 @@ "follow" and "all" which obviously are the opposite of the other values and are the default behavior. For more information on META robots tags, check out the - - HTMLAuthor's Guide to the Robots META tag. + + HTML Author's Guide to the Robots META tag.
keywords --- ../htdig3-1-x/htdoc/require.html Mon Jan 28 07:44:51 2002 +++ htdoc/require.html Fri Feb 8 15:33:19 2002 @@ -43,7 +43,7 @@ * Robot exclusion is supported
- The + The Standard for Robot Exclusion is supported by ht://Dig.
--- ../htdig3-1-x/htdoc/where.html Mon Jan 28 07:44:51 2002 +++ htdoc/where.html Fri Feb 1 12:47:21 2002 @@ -82,13 +82,13 @@ htdig.europeanservers.net (HTTP) - (FTP) + (HTTP) - (FTP) + (HTTP) - (FTP) + (HTTP) - (FTP) + www.it.htdig.org @@ -103,6 +103,13 @@   + opdenbrouw.nl + (HTTP) + (HTTP) + (HTTP) + (HTTP) + +