I've uploaded a first attempt at an automatically-generated site map: http://www2.genealogy.net/gene/tmp/up/pages/sitemap.html This particular map was truncated at two levels deep. The site mapping program derives the hierarchy of the website solely by crawling the pages. It does a breadth-first crawl and assigns a page's position when it is first encountered (necessarily at its shallowest level in the crawl). The program knows about the standard language variant filenames. It knows how to extract page names from the <TITLE> field in the header. I think it needs hints to improve the hierarchy. These might be derived from the visible headers we've been putting in the files. Your feedback is invited. -- =Jim Eggert EggertJ@LL.mit.edu
participants (1)
-
Jim Eggert