Traversal program

Lou Montulli (
Mon, 12 Apr 93 17:01:08 CDT

Last week there was quite a bit of talk about traversing the Web
to compile a list of all Web documents.

I have tweeked my traversal program a little for WWW documents
and it looks like it will work.

The traversal program only attempts to follow http: links and
keeps a list of all links as they are traversed.

The question I have now is: Should I run it? I'm not entirely
sure what it will do. It will certainly put a big load on
the network. Will it get bogged down in some incredibly dense
subtree? What day of the week would be the best time to run?

Currently the URL, Document Title and the link name that referenced
the document are saved in a Tab delimiter format. Are there
any tabs in document titles?

What does everyone else think?


