Action Items

  • Reserve - taken
    • expires 11/18/06?
  • 11/10/06: is still available.
  • 1/5/07: is still available. Do we want it? What about,,, They are all available? It seems like including the word corpus or corpora is a little more descriptive. Bill 16:26, 5 January 2007 (MST)


  • Active Learning
    • Experiments can use already labeled data
    • Simulate users with varying degrees and types of erroroneous behavior
    • Show that we can learn in the face of all user error types and establish measures of trust
    • Then involve real users
  • user trust
  • integrating mediawiki…. and using it to link to other people's datasets
  • system for contributing data sets….
    • 2 pages, one for submissions (public) one for accepted data sets (protected)?
  • use existing stand-off markup schemes (e.g., TEI)
    • allow for distributions over labels
  • allow for alternate tag sets with different granularities to enable people with varying degrees of ability
  • Computing EVSI quickly for an arbitrary joint probability model
  • Compare with Google Image labeling.

Favorite Tasks

  • NLP
    • Word dependencies
    • POS tags
    • Phrases
    • Full parse structure
    • Adding info. in one rep. would assist in deducing the others.
    • Syriac
  • Images
    • image part labeling
  • Classification tasks
    • UC Irvine set
    • contributed data sets
nlp-private/wikibank-brainstorm.txt · Last modified: 2015/04/23 13:34 by ryancha
Back to top
CC Attribution-Share Alike 4.0 International = chi`s home Valid CSS Driven by DokuWiki do yourself a favour and use a real browser - get firefox!! Recent changes RSS feed Valid XHTML 1.0