CS5604 Project Ideas

There are many possible projects suitable for CS5604 on Information Storage and Retrieval. Please look over the suggested projects in the list below, or propose a project of your own creation to the instructor for refinement and consideration.

Background for Projects

To better understand the various projects discussed below, you may want to study the information available from the links below.

  1. Digital Library Research Laboratory (at Virginia Tech): follow links regarding MARIAN, Envision, Open Archives Initiative, PetaPlex
  2. Regarding Electronic Theses and Dissertations (ETDs), see Networked Digital Library of Theses and Dissertations and the Search Site for ETDs
  3. Regarding NSDL, see the NSF site for the National Science, mathematics, engineering, and technology education Digital Library

Suggested Projects

Note that ones with "(n)" before the title have "n" subparts.
  1. NCSTRL OAI: Saving NCSTRL, moving it to OAI
  2. (4) LOVE: Working with U. of Missouri and NCSA on NSDL
  3. UVC: Archiving and preservation research with Raymond Lorie, IBM
  4. 5S
  5. 5SL
  6. Envision: Backend and Zooming
  7. TREC
  8. Internet2 Distributed Storage Initiative and OAI
  9. NCSTRL OAI: Saving NCSTRL, moving it to OAI
  10. Open Archives Distributed
  11. Open Archives Annotation
  12. Open Archives on Campus
  13. Open Archives Cross-Lingual Search
  14. Open Archives High Performance Search Engine
  15. Open Archives Rating Service
  16. Open Archives Recommender Service
  17. Open Archives Submission: Metadata is More than Forms
  18. Incorporating Web Site Management Tools into PIPE
    students: Naizheng Bian(nbian@vt.edu), Luhui Hu(luhui@vt.edu), Yahong Gu(yagu1@vt.edu)
  19. Community Arts Network
  20. UVA: 3D Visualization and Interoperability
  21. IR Education Module
  22. DL for InfoSecurity
  23. GetSmart collaboration
  24. CITIDEL support with IR
  25. CITIDEL support for building courses

Old Projects from CS4624 in Spring 2001, that can be continued

  1. ETDs - ResearchIndex to build an ETD citation database and detect plagiarism
  2. (10) PetaPlex - multiple projects on 100 processor 2.5 Tbyte system at VT
  3. ETDs - XML method for authoring works
  4. e-numerate on Campus handling and visualizing data with XML
  5. ETDs - Selective Dissemination of Information (SDI) to route interesting works based on user profiles
  6. ETDs - Similar Documents to detect plagiarism and find similar ETDs

Other Old Projects from CS4624 in Spring 2001, for Reference

  1. (5) VT Museum of Natural History work to build digital library of images/VR/info
  2. Implementing Bioinformatics Tools on the PetaPlex
  3. MARIAN digital library system developed at VT - complete porting to Java, add capabilities to support multimedia
  4. ENVISION interface developed at VT for visualizing search results
  5. Open Archives Initiative (OAI) annotation service
  6. Open Archives Initiative (OAI) on campus support
  7. Physics Digital Library and the Open Archives Initiative
  8. ETDs in Germany and US and the Open Archives Initiative
  9. ETDs - Federated Search to search multiple OAI sites for NDLTD
  10. ETDs - Selective Dissemination of Information (SDI) to route interesting works based on user profiles
  11. National Science Digital Library - CS History Proposal
  12. National Science Digital Library - CS Collection Proposal
  13. National Science Digital Library - CS Crawling and Analysis
  14. UDLA interoperability - install, apply visualization
  15. MIRAGE digital library system developed in Korea - install, make interoperable
  16. Phronesis digital library system developed in Mexico - make interoperable
  17. Greenstone digital library system developed in NZ - make interoperable
  18. Text Retrieval Conference work to compare our systems with others
  19. 5S Framework for Digital Libraries
  20. DLDL Digital Library Definition Language
  21. NCSTRL VT getting CS tech reports working
  22. Blacksburg Country Club web site, virtual tour, etc.
  23. Audio archiving in XML - with an audio engineer and expert on preservation
  24. IIT database handling of text retrieval for large collections
  25. IBM VM Linux - experiment with course s/w on multiple Linux VMs
  26. IBM VideoCharger - install and experiment with streaming video
  27. (2) CS Video - work with Sandra Birch to produce CS video(s)
  28. Set Top Box - work with Prof. Ing-Ray Chen on set top box solutions
  29. Finding applets and other helpful resources to learn about computing
  30. WPI Communications
  31. Hospitality and Tourism Dept. online course

Project Archiving

Please be sure to format your final reports in the form of websites, with internal links to all the documentation and software - that way we can archive the projects indefinitely. Please keep it simple, since we will use a plain vanilla Linux web server. For an example of what we will do, see the CS4624 Spring 2001 collection.

Project Teams

To learn about personality issues that may help you select a project group, and work more effectively with other group members, see a variant of the Myers Briggs personality test that you can take in about 5 minutes. (For background you might enjoy the newsgroup for the discussion of personality typing systems.)