History
From MWCSWiki
Wednesday, Feb 18th:
- Created the Wiki!
- Read more about HTML in Programming the World Wide Web
- Did more in depth tutorial of MySQL and PHP MyAdmin: http://www.php-editors.com/articles/sql_phpmyadmin.php
Tuesday, Feb 24th:
- Tried to Heritrix to run on rosemary, but am still getting "Starting Heritrix............."
- Tried to Heritrix to run on my Windows machine, but I get a java error when I try to compile it, and their script doesn't seem to work with my command window.
- Got Heritrix to run on Ubuntu. Currently running a job on Rosemary, will have to spend tomorrow learning about Heritrix.
Wednesday, Feb 25th:
- Computer Crashed before it finished downloading rosemary
- Read some of Heritrix user's manual
Monday, Mar. 9th:
- Experienced difficulties getting Heritrix to run on Rosemary again
- Researched ARC files
- Looked into WGET and GUIS associated with that
Tuesday, Mar. 10th:
- Read a little HTML book
Wednesday, Mar. 11th:
- Continued research of ARC files
- Found new Heritrix "How-to-Crawl" tutorial: https://wiki.lib.umn.edu/DI2/HowToCrawl
- Created a MySQL table like the one in the crawling tutorial
Tuesday, Mar. 17th:
- Finally figured out how to find/kill the old Heritrix process
- Downloaded XMing on my Windows system
- Restarted a Heritrix process
- Ran a job on https://wiki.lib.umn.edu/DI2/HowToCrawl
- Unzipped ARC.GZ file
- Downloaded ARC scripts and transferred to rosemary
Wednesday, Mar. 18th:
- Transferred more scripts to Rosemary
- Learned how to get into MySQL through the command-line
- Created dump of SQL database

