I tried loading a test document into the archive today, and there are a few things to be aware of when loading documents:
First, since each page is its own file, there needs to be a way to tell people which file represents the first page, second page, etc. There is a "description" field for each file, I suggest we use this to designate the order of the pages. It would also be good if it designated wether the file was the large archival file or the display image.
So:
"First page: Large archival quality"
Is a possible model.
Second, I'm seeing problems with the transcripts-since the file names are going to be part of the URLs, they can't have spaces in the filenames. They also don't seem to have any line breaks, which means when loaded into a web browser, they scroll off to the side. I am going to try and fix these problems, but it will probably have to be done manually.