[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

updated jar file for Project Part B



Hi all,

Some of you may have encountered problems when unjaring Project_TaskB_crawl.jar if you are using Windows. This is due to some invalid Windows filenames in the crawledpages directory. The problem has been fixed and the working jar file was just uploaded on the project webpage.

To fix the problem we had to change some of the filenames. Since the HashedLinks and term index still use the original filenames, you are now provided with two additional files in the crawledpages directory that will help you map the original filenames to the modified ones:
- filemap.txt contains all the original filenames and their replacement filename.
- filemap2.txt contains only the filenames that had to be modified. (The files in filemap2.txt are also included in filemap.txt)


Thomas

--

[Tuesday, 03/23/04]