• No results found

Toolbox 1! Susan Gehr!! Cell/text (707) !

N/A
N/A
Protected

Academic year: 2021

Share "Toolbox 1! Susan Gehr!! Cell/text (707) !"

Copied!
56
0
0

Loading.... (view fulltext now)

Full text

(1)

Toolbox 1

!

Susan Gehr! !

[email protected]!

(2)

With gratitude

!

l Albert Bickford, Toolbox instructor for InField 2008, 2010 & CoLang 2012!

l Neil Brinneman, Shoebox instructor 2003!

l Shoebox/Toolbox Field Linguist s Toolbox Google Group!

l Schoolmates at University of Oregon, especially Connie Dickinson!

(3)

Course Goals

!

l Review of Toolbox for use in creating lexical

databases!

l Introduction to connections to the topics

related to dictionary projects (workflow,

lexicography, community involvement)!

l Information about how to learn more and get

help after this class!

l Study of how Toolbox is part of the Karuk

dictionary project!

(4)

Getting to know each other

!

l What languages and projects are you working on with Toolbox?!

l Did you bring your own computer and your own data to work with? !

l  Will you be using Toolbox on the computers in Watson 419? If so, I need to ask Jari to install the New Project Package. !

l What are your current Toolbox-related questions or learning goals?!

l Will someone else be setting up your project for you, or do you need to do it yourself? !

(5)

Logistics

!

l Main class: Tuesday June 19 to Friday June 22 


2:15 to 3:45 PM!

l Location of Class: 419 Watson Library!

l Possibilities for consultation outside of class !

(6)

Course Outline

!

l June 19, Tuesday!

l Introductions to our Toolbox projects!

l Discussion of the parts and preparation behind every Toolbox Project!

l Language Encoding (orthography, sort order)!

l Database Type development!

l Workflow !

l Project Management !

l Toolbox life cycle, considerations for migrating data!

(7)

Course Outline 2

!

l June 20, Wednesday!

l Database types, continued!

l Establishing a data structure / cheatsheet!

l What are your fields? What rules do they require?!

l Documenting all decisions that you make about your workflow, whether you work alone or as part of a team!

l Getting to work – Using Toolbox!

l June 21, Thursday!

l More on Using Toolbox – Based on class need & interest!

(8)

Course Outline 3

!

l Friday, June 22!

l Related software (operating systems, Microsoft Word, Windows virtualization

software for Mac users, backup software, etc.)!

l Outputting data!

l For online and print use (dictionaries, wordlists made with filters)!

(9)

Karuk Toolbox Project

!

l Started in November 2003!

l Took 2-week course at JAARS!

l William Bright data and participation!

l Nailing down the language encoding and revising it in 2009!

l Publication of a printed dictionary (2005) and an online dictionary (ongoing with UC Berkeley Linguistics)!

(10)

Toolbox Project Preparation

!

l Before data entry can start, you need to

ʻteachʼ Toolbox some things about your project.!

l Language Encoding!

l Orthography!

l Sort Order !

(11)

Language Encoding Features

!

•  The order for sorting (using that script).!

•  Upper and lower case forms of the characters (if any).!

•  Special groupings of characters, variables, which are useful in examining or searching the data.!

•  A font to represent the character shapes.!

•  Often, a special keyboard to facilitate entering the characters.!

(12)

Exercise: Creating your sort

order

!

•  What is the order of your writing system?!

•  Will your audience expect it to sort as English does, or will they expect some other sort order?!

(13)

Language Encoding

Karuk_2009

!

(14)

Toolbox Project Preparation

!

l Database type!

l What types of information do you want to keep track of?!

l MDF – the Multi-Dictionary Formatter! l What is it? !

l A database type that can function as a data structure standard and a data content standard!

l Read !

l MDFFields  Toolbox  Project  and  then  read  

recommended  parts  of  

(15)

Toolbox Project Preparation

!

l Database type!

l What is your data structure?!

l What are your rules about entering data?!

l Do you keep a cheatsheet data structure and a notebook?!

(16)

What are databases & database types?

!

In Toolbox, databases can be:!

dictionaries!

one text or a collection of texts!

!

In Toolbox, a database type is a file that:!

includes a collection of properties that

defines various fields of the database and some of the methods used for manipulating records.!

(17)

More about database types

!

•  A Dictionary database type might contain:!

– A recommended set of Field Markers!

– Filters used for finding particular records in a lexicon (eg, all nouns, a particular morpheme, words with homonyms, etc)!

(18)

Making your own Dictionary

type

!

•  You can & should make a database type for your language by copying the MDF type and modifying the copy (see Karuk MDF)!

(19)
(20)

What fields might I use?

!

•  See the list of all MDF fields pp. 13-39, Making Dictionaries, a guide to lexicography and MDF.!

•  Take time before you start making lots of entries into your database to, otherwise you might have to either correct them by hand or have someone write a CC Table to correct all the records. !

(21)

Exercise: Choose field markers & make a cheatsheet

!

•  Basic minimum set!

•  \lx - Lexeme! •  \a - Alternate form! •  \u - Underlying form ! •  \ps - Part of speech! •  \ge - Gloss! •  \de - Definition! •  \sd - Semantic domain! •  \nt - Notes!

(22)
(23)

Toolbox Project Preparation

!

l Workflow matters!

l Do you work by yourself?!

(24)

Toolbox Project Preparation

!

l Project management!

l Printed dictionaries have front and backmatter!

l Online dictionaries have websites with

supplemental text, graphics and sound files!

l How will you keep track of master files,

working files, passwords, contact information, and other valuable records?!

(25)

Toolbox Project Preparation

!

•  Toolbox life cycle!

–  Toolbox is nearing the end of its life-cycle. Albert Bickford, SIL, June 2008.!

–  So Toolbox must be really near the end here in 2012?!

•  the [Toolbox] programmer is still active…working on a kind of large new feature which is taking its own sweet time…Our assignment for the foreseeable future is Toolbox. All

programs and people are mortal, but we are doing our best to keep ourselves and Toolbox among the living. Toolbox

Support. May 1, 2012

http://groups.google.com/group/shoeboxtoolbox-field-linguists-toolbox/!

–  For now, I will keep using Toolbox, keeping my eye out for the day that it can t be run on contemporary computers/operating systems and support dries up.!

(26)

End of Day 1?

!

l Suggestions & adjustments for following days based on student need and

preparation for Toolbox 2.!

l Work on Language Encodings and Database Types!

(27)

Day 2

!

l Work on Language Encodings and Database Types!

l Database types & workflow matters, continued!

l Establishing a data structure / cheatsheet!

l What are your fields? What rules do they require?!

l What does the MDF book say about your chosen fields? !

l What if, 5,000 records in, you want to add new fields?!

l Documenting all decisions that you make about your workflow, whether you work alone or as part of a team! l Dictionary work Google Doc!

(28)

On the Computer

!

•  Go to the Project menu > Language Encodings…!

•  Select the vernacular Language Encoding and click the Copy button.!

•  Type your sort order into the Primary Characters box.!

(29)

What to include in your Toolbox

data structure

!

From CoLang 2012 course Lexicography, Dwyer & Bowern.

(30)

Helpful Information

!

•  Scientific Names http://www.itis.gov/!

•  Toolbox

http://www.sil.org/computing/toolbox/!

•  SuperDuper! – for backup on the Mac

http://www.shirt-pocket.com/SuperDuper/

SuperDuperDescription.html!

•  (Carbon Copy Cloner) another backup solution for Mac!

(31)

Day 3

!

l June 21, Thursday! l Filtering data!

l Helps with analysis and problem solving!

l Let s set some up and use them!

l Collaboration documents!

l Data structure matters continued!

l Related software (operating systems, Microsoft Word, Windows virtualization

software for Mac users, backup software, etc.)!

(32)

Filters, Finding and Searching

!

•  Find & Search are two different things in Toolbox!

(33)
(34)
(35)
(36)

Workflow Filter: All Records

after a certain date

!

(37)

High Interest (and multi-part)

Filter: All Personal Names

!

(38)

Day 4

!

l Friday, June 22! l Outputting data!

l For online and print use (dictionaries, wordlists made with filters)!

l Working with a publisher or a printer!

l User testing!

l Does your main audience like your work?! !

(39)

Outputting Data: Print

!

•  Word list – using a filter!

– Choose a \sd – Semantic Domain filter and output a word list. !

•  Whole database!

– Karuk to English!

(40)

Outputting Data:

(41)

Outputting Data:

(42)

Outputting Data:

(43)

Outputting Data:

(44)

Outputting Data:

(45)

Outputting Data:

(46)

Outputting Data:

(47)

Outputting Data:

Filtered Word List

!

(48)

Outputting Data:

(49)

Outputting Data:

(50)
(51)
(52)
(53)

Outputting Data: XML to online

!

(54)

Working with a publisher or

printer

!

•  Find out their technical requirements!

•  Provide them with the format they require. In the case of my first printer, it was hard copy. !

•  See also 10. Completing the dictionary from Making dictionaries: a guide to

(55)

User testing

!

•  Tools for Language Revitalization: The

Online Karuk and Yurok Dictionaries.

Unpublished paper written for SJSU s LIBR 202: Information Retrieval. !

•  Dictionaries can have advisory boards, which could be a community s language committee or a subset of that committee.!

– Reviewing sections of the dictionary, handling questions or disagreements, discussing new words !

(56)

Yôotva!

!

Susan Gehr [email protected] Cell/text (707) 599-2719 http://dictionary.karuk.org/

References

Related documents

Australian Child and Adolescent Trauma, Loss and Grief Network 2010, How children and young people experience and react to traumatic events, ANU College of Medicine, Biology

2012– Mentoring committee, International Association of Relationships Research (IARR) 2012– Awards committee, International Association of Relationships Research (IARR) 2012–

• The integration of PAS into existing hospital systems resulted in the clearing up of J$ hundreds of millions from insurance companies, but we experienced much difficulty

Ordinary least square (OLS) approach was employed to estimate the regression equation with benefit-cost ratio (BCR) of watershed program as dependent variable

To address these differences, 2 separate kits have been developed so that labs using any form of Protein A can take advantage of the robust sample treatment and streamlined

City Council request the Director, Fleet Services to establish a monthly replacement reserve contribution for all off-road vehicles and equipment and that the Director, Fleet

Military systems are in service for a long time. In this period legislation often changes to become more restrictive. For example, 20 years ago asbestos was a

a) Promote safety throughout the Surface Operations Program and assist in advising the Auxiliary and the Coast Guard in matters pertaining to Surface Operations safety. b)