Boulder Language Technologies

  • Narrow screen resolution
  • Wide screen resolution
  • Auto width resolution
  • Decrease font size
  • Default font size
  • Increase font size
Workshop Agenda PDF Print E-mail

Workshop Agenda

Saturday, August 16th

The workshop agenda just below is divided into two parts. In the morning, we will review language resources needs from the perspective of researchers and funding agencies, summarize current language resource efforts and models, and examine two complementary models in some detail (LDC and CSLU). We will discuss these topics, and review and/or revise the topics for afternoon breakout groups.

After lunch, the workshop turns into a WORKshop. The first set of breakout groups will consider what language resources are needed to advance human language technology. The second set of breakout groups will consider models for funding, developing and distributing these resources. The participants will then present their recommendations, outline a report that incorporates them, and assign responsibility for authoring the sections. It is expected that this report will have a major impact on the formulation of new policies and initiatives.

MORNING

 

6:30 AM -
7:30 AM
Breakfast -
8:00 AM Welcome Gary Strong
8:10 AM Welcome and overview
of workshop agenda
Ron Cole
8:20 AM Language resources for NSF Gary Strong
8:40 AM Language resources for DARPA Alan Sears
9:00 AM Language resources for DoD Lynn Carlson
9:20 AM Break -
9:40 AM LDC and language resources Mark Liberman
10:30 CSLU's model for
language resources
Ron Cole
11:00 AM Break -
11:20 AM Resources for word sense
identification
George Miller
11:40 AM ``The Discourse Initiative'' Suzann Luperfoy

 

AFTERNOON

 

12:00 Noon Discussion and tasking
for first breakouts.
-
12:30 PM Working lunch and first breakouts.
Visit lunch buffet and take lunch
to your breakout session. Return
with maximum two foils for plenary
briefing
-
2:15 PM First breakout reports -
2:45 PM Select topics for second breakouts -
3:15 PM Break -
3:30 PM Second breakouts: what models? -
5:00 PM Second breakout reports:
5 minutes each
Mark Liberman
5:45 PM Workshop Report
(1) Generate outline of final report
(2) Assign authors to sections and subsections
(3) Establish timelines, etc.
-
6:45 PM Reception and dinner -

 

 


FIRST BREAKOUT SUBJECT:
WHAT LANGUAGE RESOURCES?

The overarching question to be addressed by the first set of breakouts is: What language resources are needed to support our National agenda? This question should be addressed in the broadest sense-- language resources include (annotated) written and spoken corpora, static images, videos of people conversing, creation of standards and evaluation methodologies, as well as tools for creating, learning about and developing language resources and technologies.

The following proposed breakout topics are grouped in terms of a general model of information retrieval-- requesting information; locating information; organizing information; and presenting information. Within each group, we ask participants to address the issues of (a) data resources (e.g., annotated corpora, lexicons, images, videos); (b) tools and technologies; (c) standards and evaluation metrics.

  • 1. Language Resources for Requesting Information. What resources are needed to develop technology to enable anyone, anytime, anywhere to request information? What resources and technology are needed to create interactive multimodal systems that engage users in a dialogue to retrieve desired information?
  • 2. Language Resources for Information Retrieval. What resources are needed to locate and analyze the wealth of audio, textual, graphic and video information consistent with the request?
  • 3. Language Resources for Organizing and Presenting Information. What resources are needed to support research and development of interactive multimodal systems that can organize, summarize present information in meaningful ways?
  • 4. Language Resources for Learning and Creating. What language resources can support learning about and creating language technology from elementary to higher education.

 

SECOND BREAKOUT SUBJECT:
METHODS AND MODELS

The overarching issue of the second set of breakouts is: How do we fund, develop and distribute language resources to all who need them? One possible set of topics is to divide the world into
  • 1. Funding models (e.g., How do current models fail? What new models should be considered? E.g.,inter-agency program on infrastructure for human language technology)
  • 2. Development models (e.g., Who does what? How is it managed?)
  • 3. Distribution models (e.g., How do we get the resources, tools, technologies into the hands of those who need them? What is the role of the internet?)
Additional topics might include:
  • 1. Data-centered community support for human-centered systems: an infrastructure-program focus for linguistic resources?
  • 2. Maintaining (and replacing) useful tools and data
  • 3. The role of the internet in data-centered research and the provision of linguistic resources for education.
  • 4. Resources for Defense needs.
 
< Prev

Member Login