Workshop Agenda
Saturday, August 16th
The workshop agenda just below is divided into two parts.
In the morning, we will
review language resources needs from the perspective of researchers
and funding agencies, summarize current language resource efforts
and models, and examine two complementary models in some detail
(LDC and CSLU). We will discuss these topics, and review and/or
revise the topics for afternoon breakout groups.
After lunch, the workshop turns into a WORKshop. The
first set of breakout groups will consider what language resources are
needed to advance human language technology. The
second set of
breakout groups will consider models for funding, developing and
distributing these resources. The participants will then
present their recommendations, outline a report that incorporates
them, and assign responsibility for authoring the sections.
It is expected that this report will have a major impact on
the formulation of new policies and initiatives.
MORNING
6:30 AM -
7:30 AM
|
Breakfast
|
-
|
|
8:00 AM
|
Welcome
|
Gary Strong
|
|
8:10 AM
|
Welcome and overview
of workshop agenda
|
Ron Cole
|
|
8:20 AM
|
Language resources for NSF
|
Gary Strong
|
|
8:40 AM
|
Language resources for DARPA
|
Alan Sears
|
|
9:00 AM
|
Language resources for DoD
|
Lynn Carlson
|
|
9:20 AM
|
Break
|
-
|
|
9:40 AM
|
LDC and language resources
|
Mark Liberman
|
|
10:30
|
CSLU's model for
language resources
|
Ron Cole
|
|
11:00 AM
|
Break
|
-
|
|
11:20 AM
|
Resources for word sense
identification
|
George Miller
|
|
11:40 AM
|
``The Discourse Initiative''
|
Suzann Luperfoy
|
AFTERNOON
|
12:00 Noon
|
Discussion and tasking
for first breakouts.
|
-
|
|
12:30 PM
|
Working lunch and first breakouts.
Visit lunch buffet and take lunch
to your breakout session. Return
with maximum two foils for plenary
briefing
|
-
|
|
2:15 PM
|
First breakout reports
|
-
|
|
2:45 PM
|
Select topics for
second breakouts
|
-
|
|
3:15 PM
|
Break
|
-
|
|
3:30 PM
|
Second breakouts:
what models?
|
-
|
|
5:00 PM
|
Second breakout reports:
5 minutes each
|
Mark Liberman
|
|
5:45 PM
|
Workshop Report
(1) Generate outline of final report
(2) Assign authors to sections and subsections
(3) Establish timelines, etc.
|
-
|
|
6:45 PM
|
Reception and dinner
|
-
|
FIRST BREAKOUT SUBJECT:
WHAT LANGUAGE RESOURCES?
The overarching question to be addressed by the first set of
breakouts is: What language resources are needed to support
our National agenda? This question should be addressed in
the broadest sense-- language resources include (annotated)
written and spoken corpora, static images, videos of people conversing,
creation of standards and evaluation methodologies, as well
as tools for creating, learning about and developing language
resources and technologies.
The following proposed breakout topics are grouped in terms of a
general model of information retrieval-- requesting information;
locating information; organizing information; and presenting
information. Within each group, we ask participants to address the
issues of (a) data resources (e.g., annotated corpora, lexicons,
images, videos); (b) tools and technologies; (c) standards and
evaluation metrics.
-
1. Language Resources for Requesting Information.
What resources are needed to develop technology to enable anyone,
anytime, anywhere to request information? What resources and
technology are needed to create interactive multimodal systems
that engage users in a dialogue to retrieve desired information?
-
2. Language Resources for Information Retrieval.
What resources are needed to locate and analyze the wealth of audio,
textual, graphic and video information consistent with the request?
-
3. Language Resources for Organizing and Presenting Information.
What resources are needed to support research and development of
interactive multimodal systems that can organize, summarize
present information in meaningful ways?
-
4. Language Resources for Learning and Creating. What language
resources can support learning about and creating language technology
from elementary to higher education.
SECOND BREAKOUT SUBJECT:
METHODS AND MODELS
The overarching issue of the second set of breakouts is:
How do we fund, develop and distribute language
resources to all who need them?
One possible set of topics is to divide the world into
-
1. Funding models (e.g., How do current models fail? What new models
should be considered? E.g.,inter-agency program on infrastructure for
human language technology)
-
2. Development models (e.g., Who does what? How is it managed?)
-
3. Distribution models (e.g., How do we get the resources,
tools, technologies into the hands of those who need them?
What is the role of the internet?)
Additional topics might include:
-
1. Data-centered community support for human-centered systems: an
infrastructure-program focus for linguistic resources?
-
2. Maintaining (and replacing) useful tools and data
-
3. The role of the internet in data-centered research and the provision
of linguistic resources for education.
-
4. Resources for Defense needs.
|