Applying Textual Analysis and Digital Visualization to a Poetic Body of Work
Project Personnel
Ray Henry, organizer
[resume linked here]
Stacy Waters, consultant
Project Description
A review and selection of both working manuscripts and completed works.
Working manuscripts will be scanned to present a visual transformation from an
initial moment of creation through redaction and revision to the completed work.
Completed works (all extant completed works exist digitally) will also be
transformed in two ways. First, a textual analysis of word frequency for the
presented set of works, presented visually (i.e., not as a table, but as a tag
cloud or something similar). Second, every work will be broken apart into lines
which will be recombined (through particular constraints) into new "machine
works," which can be interactively generated. These visualizations will be
united on a single website.
- Collect works and select the set of
materials to be included.
- Develop structure/organization of website.
- Set up skeleton website based on developed structure.
- Scan working manuscripts.
- Compile completed works and apply textual analysis tools.
- Research and develop tag cloud.
- Break completed works into lines--enter into a database or spreadsheet
structure?
- Develop and implement "machine" to generate new poetry.
- Add all elements to the site.
- test/evaluate (standards compliance, accessibility, does everything work,
etc.).
Updates shown below in red
.
Project Work Plan
TITLE: Develop Presentation Website For Visualizing Poetic Analysis - Work
Plan
- DESCRIPTION: tbd
- TASK LIST
- TASK: Preplanning (6-8 hours)
- Make paper mock-up of website structure
complete
- TASK: Skeleton website (12-15 hours)
- Research website accessibility and interoperability
- Create placeholder pages
- TASK: Collect and select works (15-20 hours, not including travel time to
Portland)
- Gather all potential work - complete
- Identify significant work/examples of work
- Identify files to be digitized and setup naming system
- complete
- Attach filename to each non-digital file
complete
- Change non-compliant digital filenames
complete
- TASK: Scan ~10 pages of working manuscripts (~3+ hours)
- Scan page
- Save archival TIFF image
- Adjust and add copyright information
- Save JPEG version complete
- TASK: Analyze completed works - word frequency (10 hours)
- Convert MS Word documents to plain text
complete
- Run analysis tool complete
- Record results complete
- TASK: Build analysis visualization (10 hours)
- Identify visualization scheme and tool
complete
- Apply data to tool complete
- Archive/link results complete
- TASK: Break apart existing works (5 hours)
- Break works into lines and compile
TEI-compliant XML tagging begun
- Add lines to database/spreadsheet
complete
- TASK: Build "poetry machine" (20 hours)
- Research mechanism to randomly pull lines from spreadsheet/database
complete
- Research mechanism to recombine lines
- perl - complete
- Research mechanism to display poems - complete
- Create "machine" complete
- TASK: Add all elements to skeleton site (15 hours)
- Add selected scans of manuscript pages
- Add text analysis visualization complete
- Add "poetry machine" complete
- TASK: Test and evaluate site (10 hours)
- Check and evaluate functionality
- Check and evaluate accessiblity
- Check and evaluate adherence to standards
- Check and evaluate interoperability
- TIME LINE
- TOOLS/RESOURCES
- Equipment
- CPU
- Scanner
- Archive/back-up media
- Software
- Spreadsheet/database
- Image editing software
- Text editor
- Code development environment (possibly)
- Website development tools (possibly)
Total time: ~130 hours (~16 hours/week for 8 weeks)
Back to the main page