Language Documentation:

Language Documentation: Theory, Practice, and Values

 

Harvard University

Emerson Hall

Cambridge, MA

 

 

9-11 July 2005

Download this program as a printable pdf.

 

Impelled by concerns about the accelerating loss of languages, language archiving has emerged as a prominent issue for linguists and for heritage language speakers alike. The quality of documentation available for an endangered language can determine the success of its revitalization. This conference builds on the LSA Conversation on Endangered Languages Archiving which met 5-6 January 2005 immediately before the LSA Annual Meeting. Six themes will be treated in conventional paper presentation sessions, in poster sessions, and in panel presentations with open discussion.

The requirements of field linguistic training

The concerns and involvement of the heritage language communities

The question of what is adequate documentation

The uses of documentation in speaker communities

Training and careers in field linguistics

Ethics and archiving best practices

 

Acknowledgements

 

We thank the following organizations and individuals for their support of this conference:

 

National Science Foundation: Joan Maling and James Herbert

Harvard University: Jay Jasanoff, C-T James Huang, Cheryl Miller, and Kobey Shwayder

MIT: Sabine Iatridou, Mary Grenham, and Suzanne Flynn

 

This conference was funded by an NSF Grant .

 

Public Sessions

 

 

Day 1: Saturday, 9 July 2005

 

Registration

 

Time: 7:30 AM – 12:00 noon

Room: Emerson Hall (hallway in front of Room 210)

Cost: $25.00 (cash or check only)

 

 

Program

 

All sessions will be in Emerson Hall Room 210, unless otherwise indicated.

 

 

Conference Opening and Welcome

 

Time: 8:00 AM

Convener: N. Louanna Furbee (U MO-Columbia/LSA Archivist)

 

 

Session I Training for Language Documentation

 

Chair: Peter Austin (School of Oriental Studies [SOAS], London)

 

8:15 AM Keynote Speaker: Peter Austin (SOAS)

 

Training in Language Documentation: The SOAS Experience

 

 

8:50 AM Carol Genetti (UC-Santa Barbara)

 

Components of a Summer Institute in Field Linguistics

 

We present a model for a biennial Summer Institute in Field Linguistics to be held at UC-Santa Barbara beginning in 2008. The institute will consist of two independent phases: a six-week phase containing two courses in field methods, and a shorter phase containing workshops on practical aspects of fieldwork. Topics could include recording techniques, discourse transcription, digital archiving, software and fonts, intercultural and inter-personal issues, and maintaining health and well-being during fieldwork. We seek feedback from the broader linguistics community so that we can mold the institute to best meet the needs of the field.

 

 

9:15 AM Frances Ajo, Valerie Guerin, Ryoko Hattori, & Laura Robinson, (U HI-Manoa)

 

Native Speakers as Documenters: A Student Initiative at U HI-Manoa

 

 

9:40 – 10:00 AM BREAK

 

 

Session II What Is Adequate Documentation?

 

Chair: K. David Harrison (Swarthmore C/Chair, CELP)

 

10:00 AM Keynote Speaker: K. David Harrison (Swarthmore C/Chair, CELP)

 

Ethnographic Dimensions of Documentation

 

 

10:40 AM Laura Buszard-Welcher (Rosetta Project/UC-Berkeley)

 

Necessary and Sufficient Data Collection:

Lessons from Potawatomi Legacy Documentation

 

Our modern concept of necessary, or 'core' language documentation remains much the same as it has for the last 75 years: a grammar, dictionary, and text corpus. While this set is certainly necessary, it is demonstrably not sufficient. The collection of Potawatomi legacy documentation by Charles Hockett is a good illustration: His discussion of morphosyntax, while extensive, was based entirely onwhat he found in collected narratives, and it turns out that conversational morphosyntax is quite different. This example raises the question of whether our own concept of language documentation will be viewed as comprehensive by following generations of researchers.

 

 

11:00 AM Verónica Vázquez Soto (UNAM, México, DF)

 

Documenting Different Genres of Oral Narrative in Cora (Uto-Aztecan)

 

We focus on documenting a broad range of oral genres in Cora (Uto-Aztecan language spoken in Northwest Mexico in the State of Nayarit) oral narrative. Adequate documentation of oral narrative in this language should cover different genres which go from the 'word for playing' or 'toy word' called in Cora niaukari hi'wahkari to the 'ancient word' called in the language niukari mime'ekan. The former genre is usually told for children and covers funny stories about particular animals whereas the latter genre is properly told for adults, although children can be part of the audience for this narrative. The ancient word covers stories about the creation of the Cora universe, the creation of corn and other important entities of Cora culture. We hope that this project would give rise to a more adequate documentation of one Cora dialect from which we can obtain an extensive collection of texts and a more precise and exhausive dictionary, as well as better studies of the grammar, such as word order, evidentials, and other relevant grammatical and pragmatic subjects. We will show some fragments of texts as well as some fragments of videotapes to illustrate the different styles of the narrators telling a story.

 

 

11:20 AM Discussion (led by K. David Harrison)

 

 

11:40 AM – 12:40 PM LUNCH BREAK

 

 

Session III Models of Successful Collaboration

 

Chairs: Akira Yamamoto (U KS/Past Chair, CELP)

Arienne Dwyer (U KS/DoBeS, Germany)

 

12:40 PM Keynote Speaker: Martha Macri (UC-Davis/Terralingua)

 

Working with Language Communities in Unarchiving:

Making J. P. Harrington Notes Accessible

 

The J. P. Harrington Database Project consists principally in transcribing and coding the linguistic and ethnographic notes on American Indian languages collected by Harrington during the first half of the 20th century. The men and women he interviewed were often among the last remaining speakers of their language--the notes a treasure of indigenous knowledge that otherwise would have been lost. Perhaps of greatest value to Native American community scholars in cultural and language revitalization, the project has been especially successful in combining efforts of both the academic community and native scholars.

 

 

1:10 PM Philip Cash Cash (U AZ/Nez Perce Tribe)

Susan Penfield (U AZ)

 

Technology-Enhanced Language Revitalization

 

We present a progress report on recent efforts to establish meaningful links between technology-based language instruction and the field of language revitalization. Through a collaborative grant between the Colorado River Indian Tribes (CRIT) and the University of Arizona, a project was implemented to train CRIT speakers of Mohave and Chemehuevi to use software which supports preservation and pedagogy for these languages and to offer this training episode as a model, disseminating information about technology-enhanced language revitalization at the University of Arizona’s American Indian Language Development Institute (AILDI). The results are a growing roster of multimedia-based courses at AILDI.

 

 

1:30 PM Arienne Dwyer (U KS/DoBeS, Germany)

 

An Appropriate Collaborative Model

 

Since local circumstances vary widely, many different models of successful collaboration exist. This short contribution instead explores the commonalities of productive collaborative research. Ingredients including fully consultative planning, clear goal-setting, sufficient training, a smooth workflow, and developing autonomous indigenous capacities. Illustrative examples from several projects are provided.

 

 

1:50 PM Discussion (led by Arienne Dwyer)

 

 

2:20 - 2:40 PM BREAK

 

 

Session IV Uses of Documentation in Speaker Communities: Three Case Studies

 

Chair: Nicholas Ostler (Foundation for Endangered Langs)

 

2:40 PM Dennis Kartammeru O'Brien (U Adelaide, Australia)

 

Waking Up an Ancient Language:

Reclaiming Kaurna of the Adelaide Plains, S Australia

 

 

3:00 PM Faizi Inayatullah (NWFP, Pakistan) WITHDRAWN

Technical Document for Semiliterate Speakers:

Success Story of Kalasha Language, Northern Pakistan

 

Kalasha is an endangered language of the Indo-European family (Indo-Arian subfamily) in the Kafir group of Dardic languages. The Kalasha minorities live in the Chitral district of NWFP, in the Hindu Kush mountain range on the border of Afghanistan in Northern Pakistan. The Kalash were once the dominant people in the area. The Kalash culture as well as the Kalasha language is highly endangered at the wake of forceful conversion. In the case of Kalasha, highly technical reports have been helpful for the semiliterate speaker community in revitalizing the language and promoting 'linguistic activism' among the younger generation.

 

 

3:20 PM Colleen M. Fitzgerald (TX Tech U)

Language Documentation in the Tohono O'odham Community

 

We examine documentation efforts on the Tohono O'odham (or Papago) Reservation in Arizona, focusing on the legacy recordings held in archives or by individual researchers which are inaccessible to community members. These efforts reflect input from community members about community concerns, educational needs, future goals for the Nation, language revitalization, dialect education, and accessibility. These recordings present ethical considerations dealing with 'ownership' and intellectual property, and the treatment of culturally-sensitive collections. A long-term project to transcribe, translate, and publish these O'odham materials has implications beyond linguistics, raising significant questions about who determines the benefit and impact to indigenous communities.

 

 

3:40 PM Discussion (led by Nicholas Ostler)

 

 

Poster Session

 

Room: Emerson Hall 2nd Floor Hallway

Time: 4:10 - 5:30 PM

Chair: Lenore A. Grenoble (Dartmouth C/Past Chair, CELP) & Theme Chairs

 

Theme I: Training for Language Documentation

Chair: Peter Austin (SOAS)

 

Andrew Garrett (UC-Berkeley)

 

Fieldwork as Philology, or, the Boasian Revolution in Linguistics"

 

We present an example or two, from California, of linguistic fieldwork methodology stressing (1) transparency, in the form of documentation that is fully and immediately accessible to the language community, and (2) integration of both modern and archival sources and of structural-grammatical information and textual-sociocultural information.

 

 

David Leedom Shaul (Venito Garcia Lib & Archives, Tohono O'odham Nation)

Danny Lopez

 

Living Latins:

The Role of Literature (Written Broadly) in Language Revitalization

 

We examine the role of literature (oral and written) in different processes of language revitalization. The role of traditional as well as metropolitan genres are examined as well as the role of audience/participant in genres. The role of tribal libraries and other cultural institutions are also examined.

 

 

Theme II: What Is Adequate Documentation?

Chair: K. David Harrison (Swarthmore C/Chair, CELP)

 

Susan Meredith Burt (IL SU)

 

Documentation of Pragmatics and Metapragmatics:

Language Shift and Pragmatic Change in Hmong Language in Wisconsin

 

The pragmatics of minority languages are vulnerable to change early in language shift and should, therefore, be included in language documentation. To research language shift and pragmatic change, 30 speakers of Hmong were interviewed in Wisconsin, using an oral Discourse Completion Task. Responses show that influence from Anglo-American pragmatics has affected younger speakers’ usage in Hmong; in particular, the frequency of please has influenced frequency of the Hmong equivalent thov, causing younger and older speakers to differ in metapragmatic comments on usage. Pragmatics and metapragmatics matter because ordinary speakers, who construct their social worlds through linguistic interaction, may treasure tools that allow them to do so—pragmatic resources of their language.

 

 

Barbara Lust (Cornell U), Elaine Westbrooks (Cornell U), Suzanne Flynn (MIT), & Theresa Tobin (MIT)

 

Developing Adequate Documentation for Multifaceted Cross-Linguistic Language Acquisition Data

 

We describe collaborative work in which we seek to establish best practices for documentating large, continually expanding amounts of language data of various types. Existing multimedia data currently involve thousands of samples of language at various periods of language acquisition (both in child and adult), in various situations (both naturalistic and experimental), and across more than 20 different languages in more than 20 countries. By linking university researchers with university library experts, we seek a documentation system for this and for future data which at once (1) links the data to domain-specific linguistic analyses which are necessary for research; (2) attempts to calibrate across various languages in doing so; (3) links to field-wide standards for this description, e.g. such as those being developed by Emeld (Electronic Metastructure for Endangered Languages Data) now; (4) links to field-wide standards for linking such resources to others in the field, e.g. such as OLAC (Open Language Archiving Community) is developing now; and (5) links to the crucial upper level documentation system of a university library and interlibrary domain, one which, through metadata systems and general web-based ontologies, situates language data in a general knowledge domain and renders the data accessible to general library users. We report on our program, progress, and problems in this endeavor.

 

 

Jess Tauber (Oakland, NJ)

Enough Is Enough?: The Case for Yahgan

 

 

Theme IV: Uses of Documentation in Speaker Communities

Chair: Nicholas Ostler (Foundation for Endangered Langs)

 

Hermelindo Aguilar Méndez (CISC, Comitán, México), Teresa López Méndez (CISC, Comitán, México), Juan Méndez Vasquez (CISC, Comitán, México), Maria Bertha Santiz Perez (CISC, Comitán, México), Ramon Jiménez Jiménez (CISC, Comitán, México), Louanna del Socorro Guillén Rovelo (U Valle de México, Tuxtla Gutierrez, Chiapas, México), N. Louanna Furbee (U MO-Columbia/CISC, Comitán, Chiapas, México/ECOSUR, San Cristobal, Chiapas, México), Robert A. Benfer (U MO-Columbia), & Rolando Tinoco Ojanguren (CISC, Comitán/ECOSUR, San Cristobal, Chiapas, México)

 

Saving Languages, Saving Lives:

Tojolabal Maya Language Revival within a Health Research NGO

 

The Centro de Investigaciones en Salud de Comitán (CISC) engages in epidemiological research and health intervention in southern Chiapas, Mexico, where the predominant indigenous language is Tojolabal Maya. With 50,000 speakers but a shrinking footprint, Tojolabal is in early endangerment. CISC offers a model for other non-linguistic enterprises interested in language renewal. Partnering with the Tojolabal-speaking community, it has conducted collaborative research into bilingual interviewing, published literary works in Tojolabal and Spanish, and sponsored an international gathering of Tojolabal scholars. Its work increases Tojolabal use by adding intellectual discourse as an approprpiate context for the language and encourages documentation centered on sets of coherent subject matters.

 

 

Tjeerd de Graff (Frysk Acad, Ljouwert)

Presenter: Nicholas Ostler (Foundation for Endangered Langs)

 

Siberia's Sound Archives:

From the Early 20th Century to Today's Research and Tomorrow's Communities

 

 

Theme V: Ethics and Archiving Best Practices

Chairs: Alice Harris (U Stony Brook-SUNY) and Martha Macri (UC Davis/Terralingua)

 

David Golumbia (U VA)

 

Representing Minority Languages and Cultures on the World Wide Web

 

This poster offers an evaluation of websites describing minority languages from around the world with an eye toward developing best-practices guidelines for their construction. We argue that the most effective websites are not necessarily the most comprehensive in terms of the specifically linguistic information they include (dictionaries, grammatical description, text archives, etc.) since these can give native speakers a feeling of estrangement from their own language, while also creating the impression that the language is a static artifact. The best websites strike a balance between linguistic representation and a sense of social exchange with the larger world.

 

 

Comfort Pratt (TX Tech U)

 

Death of a Dialect: The Case of Adaeseño Spanish

 

Nick Thieberger (U Melbourne, Australia)

Presenter: David Nash (Australian Ntl U)

 

Pacific and Regional Archive for Digital Sources in Endangered Cultures (PARADISEC)

 

The Pacific And Regional Archive for Digital Sources in Endangered Cultures (PARADISEC) has been operating since mid-2003 based in a consortium of four universities in Australia. In that time, we have located and digitized some 850 hours of field recordings, with a further 3,000 hours in the queue. We have established a methodology for description of this material and are experimenting with linking fieldnotes and textual material with media. We are particularly involved in training and advocacy for language documentation methods that result in good archival data and in repatriating these recordings once digitized. This poster covers the main issues dealt with by the project, which is the result of a community of interested linguists and musicologists working collaboratively in a cross-institutional partnership.

 

 

Helen Aristar-Dry (LINGUIST List/E MI U), Anthony Aristar (LINGUIST List/Wayne SU), Naomi Fox (LINGUIST List/Wayne SU), Susan Hooyenga (LINGUIST List/EMI U), Steve Moran (LINGUIST List/E MI U), Megan Zdrojkowski (LINGUIST List/E MI U)

 

Creating Language Resources That Last:

The E-MELD School of Best Practices in Digital Language Documentation

 

A growing number of projects focus on documenting endangered languages before they disappear, but it is also important to be aware of the need to preserve the documentation itself. Paper, audiotapes, videotapes, and computer storage media are all prone to degradation and destruction. Moreover, the proliferation of digital standards and formats inhibits the interoperability and reusability of language resources. The E-MELD School of Best Practices (http://emeld.org/school/) is designed to teach linguists how to create language documentation that can be accessed and used by the widest number of current speakers and researchers, as well as by future generations.

 

(Refereed posters will be on display from 8:00 AM - 5:30 PM.)

 

 

Day 2: Sunday, 10 July 2005

 

 

Session V Ethics and Archiving Best Practices

 

Chairs: Alice Harris, (U Stony Brook-SUNY)

Martha Macri (UC-Davis/Terralingua)

 

Moderator: Alice Harris

 

8:00 AM Martha Macri (UC-Davis/Terralingua)

 

The Linguist's Responsibilities to the Community of Linguists

 

 

8:20 AM Keren Rice (U Toronto)

 

The Linguist's Responsibilities to the Community of Speakers

 

The past 30 years or so have seen dramatic shifts in what is regarded as the responsibilities of linguists to the communities with which they work. We provide an overview of some of the types of changes that have occurred in what is considered to be responsible fieldwork, focusing on work with Aboriginal communities in Canada. A model has developed that involves taking seriously a number of key points, particularly working with the community to determine what should be researched and how, and carrying out the research in a respectful way.

 

 

8:40 AM Michael Krauss (U AK-Fairbanks)

 

Can Linguistics Be Ethical?

 

Linguistics can be ethical, during the Language Holocaust now under way, by documenting disappearing languages in the field so as also to help them not disappear. Rational coordinated prioritization: along with social factors, e.g. also for languages with fewest speakers, most isolated genetically or typologically, and of course least documented. Responsibility for previous or related work, basic competence, e.g. observational adequacy, sure and accessible archiving. Mutuality of understanding with speakers and community, training of speakers, and strengthening of language status in the process.

 

 

9:00 AM Lise M. Dobrin (U VA)

 

When Their Values Conflict with Ours:

Linguists and Community Empowerment in Melanesia

 

In Melanesia, where hundreds of languages are increasingly endangered, relationships with outsiders are highly desired, and they are empowering for communities because they reflect positively on their value and enable them to galvanize the cooperation and unity necessary for community-level projects to succeed. So even though a language documentation project might appear to us to have been successfully completed, it may be perceived by the people themselves as having failed when the outside linguists withdraw. To understand our role as merely developing resources like grammars or documentation skills is to risk disappointing and even disempowering the Melanesian people whose languages are at stake.

 

 

9:20 AM Discussion (led by Alice Harris)

 

 

9:40 - 10:00 AM BREAK

 

 

Session VI: Training and Careers in Field Linguistics

 

Chair: Martha Ratliff (Wayne SU/Past Chair, CELP)

 

10:00 AM Keynote Speaker: Paul Newman (IN U)

 

Field Methods Courses at American Universities:

The Institutional (Non)Response to the Endangered Languages Movement

 

This paper presents the results of a recent survey of field methods courses in America. Surprisingly, the situation is little changed from what it was a dozen years ago before language documentation came into vogue. It is suggested that whatever value field methods courses may have, that value does not lie in preparing students for actual field research. Some reasons for this are: (1) Students in the courses are inadequately trained. (2) Such courses have to cater to "casual takers"; (3) their length and intensity are insufficient, (4) the courses bear little connection to the reality of fieldwork.

 

 

10:40 AM Spike Gildea (U OR)

 

Field Linguistics Training at the University of Oregon

 

 

1l:00 AM Mary Laughren (U QLD, Australia)

Jane Simpson (U Sydney, Australia)

 

A Survey of Australian Fieldwork Training Programs

 

We discuss fieldwork training in Australia, including internships, field methods courses, and computer skills workshops. We discuss the results of a survey (21 respondents) carried out by the Linguistics Program (Melbourne U) in 2004/2005 for the LSA Committee for Endangered Language Preservation on training students to carry out linguistic fieldwork in Australian and New Zealand universities. (Collated Cathy Bow, Australian Linguistic Society Newsletter, Feb. 2005; http://www.als.asn.au). We talk about the role of Aboriginal Language Research Centres in Australia as a focus for fieldwork and intermediaries between university linguistics departments and Aboriginal communities. We discuss career opportunities for field linguists..

 

 

11:20 AM Discussion (led by Martha Ratliff)

 

 

Scribes' Summary Report

 

Time: 11:45 AM –1:00 PM

 

Doug Whalen (Haskins Labs/Endangered Lang Fund)

Jeff Good (Max Planck Inst-Leipzig/OLAC)

 

 

 

Closed Session

 

Day 3: Monday, 11 July 2005

 

 

Session VII: Extending the LSA Conversation on Archiving Endangered Languages

 

Chair: N. Louanna Furbee (U M)-Columbia/LSA Archivist)

 

Participants: LSA Conversation Participants from Oakland Meeting & Other Invitees