Transkripsjonsrettleiing for ScanDiaSyn



Guidelines draft: Transcription with ELAN for Ethiopian spoken languages

Kristin Hagen September 2015

Innhald

0 About the guideline 2

1 Advice for transcription 2

1.1 Backup* 2

1.2 File names 3

1.3 Interviewer and informant names 3

1.4 Write a work log 3

2 Transcription and proofreading in ELAN 4

2.1 ELAN 4

2.2 Audio files in ELAN 4

2.3 Starting a new transcription and define speakers 5

2.4 Continue with a transcription 5

2.5 Segmentation 5

2.5.1 Useful shortcuts, Segmentation Mode 7

2.6 Transcription 7

2.6.1 Choose your own font 7

2.6.2 Transcribing (and proofreading) in Transcription Mode 8

2.6.3 Useful shortcuts, Transcription Mode 10

2.7 Proofreading 10

2.7.1 Proofreading in Annotation Mode 10

2.7.2 Useful shortcuts, Annotation Mode 11

3 Transcription rules 11

3.1 Transcribe everything* 11

3.2 When the transcription rules are not enough 11

3.3 Division of speech in segments * 12

3.3.1 Segmentation * 12

3.4 Annotate extra information and non-linguistic sounds 13

3.4.1 General tag principles * 13

List of dependent and independent tags * 14

3.4.2 Sensitive information and other elements from the recording that should not be transcribed * 15

3.5 Orthographic transcription 16

3.5.1 Main rules 17

3.5.1.1 Variation 17

3.5.1.2 Contraction 18

3.5.2 Exception from the main rule 18

3.5.3 Words which is not in the dictionary or word list 18

3.5.3.1 Abbrevations 19

3.5.3.2 Compounds 19

3.5.3.3 Dialect 19

3.5.3.4 Words from other languages 19

3.5.3.5 New words, swearing 20

3.5.4 Quotations 20

3.5.5 Interjections 21

3.5.6 Numbers 21

3.5.7 Names 21

3.5.8 Non-linguistic sounds* 22

3.5.9 Breaks, pauses and unclear parts 22

3.5.9.1 Aborted words* 22

3.5.9.2 Unfinished statements 23

3.5.9.3 Pauses* 23

3.5.9.4 Unclear parts or words* 24

3.5.10 Capital letters and punctuation* 24

4 Proofreading 24

5 Overview: shortcuts, tags and interjections 24

5.1 Overview shortcuts 24

5.2 Tags 26

5.3 Lists of interjections 27

6 ELAN on a new computer 28

About the guideline

These guidelines are based on guidelines for three Norwegian speech corpora projects from the Text Laboratory: NoTa-Oslo, Nordic Dialect Corpus and LIA.

The guidelines have to be complemented with separate guidelines for each Ethiopian Language.

?? (two question marks) is used in sections in the guidelines where there is an obvious need for new explanations or examples.

If the transcriptions are going to be a part of a speech corpus, you have to follow the instructions in chapter marked with * carefully. You also have to use ELAN as described in section 2.

In section 1 you will find a short general introduction to transcription and some useful information about file and informant names. Section 2 is more detailed and practical and gives advice about transcription and a short introduction to ELAN. Section 3 presents a draft for transcription guidelines for Ethiopian languages. Section 4 deals with proof reading and section 5 gives an overview of shortcuts in ELAN.

Advice for transcription

It is important to establish good routines and habits when doing transcriptions. The goal is that the work should progress as quickly and flawlessly as possible. Not everything that works for one person works for everyone, but below you will find some advice on good practice:

 

- Take frequent short breaks where you focus your eyes on something else and stretch your legs!

- Do not work more than four hours consecutively with transcription.

- Learn and use the shortcuts.

- Focus on what you hear. It is easy to write what you expect to hear, not what is actually said.

1 Backup*

If you work on a laptop or a computer not connected to a university server, you have to remember to take a backup of all your audio files and a backup copy of your transcription after every working session. Use a memory stick or save it on another computer.

2 File names

?? How to write file names for Ethiopian files?

We use this pattern for our resent project:

Audio files:

name of recording place or dialect _ university abbrivation _ running number (starts again on 1 for each place.

Example:

valdres_uio_01

valdres_uio_02

Transcription files:

Get their names from the audio file with initials from the trancriber backmost

Example:

valdres_uio_01_kh

valdres_uio_02_kh

3 Interviewer and informant names

?? Please decide for Ethiopian informants!

Informants (speakers) gets their name from the audio file plus a number for each informant.

Please note that blanks are not allowed!

Examples:

valdres_uio_0101

valdres_uio_0102,

valdres_uio_0103

valdres_uio_0201

valdres_uio_0202

4 Write a work log

?? Do you want your transcribers to write a log? Our transcribers have to save a log at our common project area at the university server after each working session. The file is named their name and log:

They write date – hours worked – which file they worked on (from start time slot to end time slot) – and a comment about the file (here they can write if the speech was hard to understand etc). In this way we can keep an eye on what they do and how fast/slow they work,

Transcription and proofreading in ELAN

1 ELAN

ELAN is a free, transcription program from the Max Planck Institute and The Language Archive in the Netherlands.

For more information about ELAN, visit the webpage:

You can also download ELAN from this site.

 

In this section we will briefly describe how the program works. On the website you will find a full manual for the program.

ELAN has three different modes:

- Segmentation Mode

- Transcription Mode

- Annotation Mode

In Segmentation Mode you can easily divide the speech flow into sections of reasonable time length to be transcribed in Transcription Mode afterwards. In Annotation Mode you can both segment and transcribe, but not as efficiently. We therefore recommend Annotation Mode for proofreading only.

It is probably most efficient to alternate between periods with segmentation and periods with transcription for variety.

2 Audio files in ELAN

To get a waveform view of the audio file (see the chapters above) you have to have a wav version of your audio file.

You can use the freely available Audacity tool to convert your mp3 files:



Choose Import file and Export As wav.

Windows media player files can also be converted to wav. If it is not possible on your recording devide, you can use for instance this free software:



3 Starting a new transcription and define speakers

- Choose File ( New

- In the window to the left choose the appropriate audio file

- Check for Template and choose the template-cois.etf in the window to the left

?? Do you want to make a common template file for all transcriptions? We have a template for the LIA project containing a predefined speaker1.

- Give the speakers in the audio file names. Each speaker has its own tier under the wave forms in Annotation mode (which is the default mode when opening a new file in ELAN). The order of the speakers/tiers has no meaning. You give names to the tiers as follows:

o Choose Tier ( Change Tier Attributes.

In the dialog box Tier Name and Participant should have the same names as in the filename. In template_cois speaker1 and interviewer1 are defined in advance but the names should be changed.

o In addition, remember to check:

▪ Annotator: Set your initials here.

▪ Parent Tier: none.

▪ Linguistics Type: utterance type.

▪ Default language may stand as System default.

If you need to define more speakers, click Add and fill out the dialog box in the same way. You can define new speakers any time during transcription.

4 Continue with a transcription

Double-click on the transcription file you want to work with, and you will automatically bring up both the transcription and audio file. You can also open the program ELAN and then choose Open and the correct transcription file. The audio file will be opened automatically.

5 Segmentation

In Segmentation Mode the audio stream is divided into smaller parts or segments to be transcribed later. For each time we start and end a segment, a timecode is written in the transcription and in this way the transcription will be linked together with the audio file. You can read more about segments, replies and turn taking in Chapter xx.

- Choose Options ( Segmentation Mode.

Note that it is not possible to transcribe in Segmentation Mode!

[pic]

Use the play button on the media player or easier, press Ctrl + Spacebar to listen to the recording. . Stop the media player in the same way. If you have selected an area in the wave form, use Shift + Spacebar to play the exact area.

You can switch between active tiers with the up and down arrows. The active tier can be divided into segments (annotations) using the cursor (crosshair) and enter key. Set the cursor where you want the segment to begin in the waveform and select the start with the enter key. Then, set the cursor where you want the segment to end and press the enter key again. You now have a segment!

If you transcribe a speaker who talks for a long time without breaks and wish to split the speech into several segments without pauses between, press enter twice to end the old segment and start the new one.

You can move the cursor in the waveform using the mouse or by using shortcuts (see below). A segment can be moved or expanded/shortened by clicking and dragging.

In a conversation or interview with two or more participants, it may be useful to concentrate on segmenting one tier (or speaker) at a time: First, you can divide into segments speaker A completely, then speaker B. It is also possible to switch between tiers and speakers, see the illustration

[pic]

You can change the size of the waveform using the small button on the bar to the right at the bottom of the page.

1 Useful shortcuts, Segmentation Mode

Note that some of the shortcuts are different for a PC and a Mac.

|Function |PC and Mac | |

| | | |

|Merge a selected segment with next segment |Ctrl+A | |

|Merge a selected segment with the former |Ctrl+B | |

|Segment | | |

|Divide selected segments |Ctrl+Enter[1] | |

| | | |

|Select a tier as the active tier |Arrow up | |

|Select the tier below as the active tier |Arrow down | |

| | | |

|Play/Pause |Ctrl+Space | |

|Play a selected area |Shift+Space | |

Use the mouse to move the cursor or use the shortcuts.

|Function |PC and Mac |

| | |

|Move the cursor one second to the left |Shift+Arrow left |

|Move the cursor one second to the right |Shift+Arrow right |

Ctrl/Cmd+Shift+Arrows can be used for moving the curson a little to the right or left.

6 Transcription

In Transcription Mode you can transcribe the part of the recording you have already segmented in Segmentation Mode. Note that you cannot edit the segmentation in Transcription Mode.

1 Choose your own font

If you have a special font installed on your computer, you can easily choose it as the front used for transcription in ELAN. Choose Tier and Change Tier attribute. In the dialogue box, choose More options and then Tier Font.

2 Transcribing (and proofreading) in Transcription Mode

Choose Options ( Transcription Mode.

When you switch to Transcription Mode for the first time, you have to configure the Transcription Mode settings. Choose columns 1, Utterance type and font size 12.

[pic]

Under Select tiers all speakers must be chosen.

You now get a display with the segments made in Segmentation mode shown under one another with the names of the tiers in different colors.

[pic]

There are several functions for playing the recording:

- You can click in the white transcription field

- The Tab button works as play/pause

- Shift + Tab plays from the start of the segment

- Enter goes to the next field and plays

- Alt + Arrow up/down shifts fields and plays

Select the Loop Mode (upper right over the waveform) if you want the segment to be played repeatedly until stopped by Tab

You can change the speed by adjusting Rate (line under Volume). As a rule, it should be Rate 100, but if something is very unclear or the speaker talks very quickly, try to adjust Speed.

The waveform in the left window may be of help. The size of the waveform can be changed by dragging the right edge. If you want to change the segments, you must return to Segmentation Mode or Annotation Mode.

3 Useful shortcuts, Transcription Mode

|Function |PC and Mac |

| | |

|Go down to the next segment |Enter/Alt+Arrow down |

|Go up to the segment above |Alt+Arrow up |

| | |

|Play/Pause |Tab  |

|Play the seggment once more from the start |Shift+Tab    |

Use the same shortcuts as in Segmentation Mode for moving the cursor (see Chapter 2.4.2).

7 Proofreading

Transcription Mode is also suitable for proofreading, but you can use Annotation Mode too where you get a good overview of both segmentation and transcription. You can read more about proofreading in Section 4.

1 Proofreading in Annotation Mode

Choose Options ( Annotation Mode.

In Annotation Mode you can proofread both transcription and segmentation.

[pic]

Double-click on a segment (here called an annotation) to enter and correct the transcription. By using Alt+M the annotation is shown in a small window and can be edited there. Press Ctrl+Enter to save the changes. Segment boundaries can be changed by clicking and dragging while pressing the ALT-button.

2 Useful shortcuts, Annotation Mode

|Function |PC and Mac | |

| | | |

|Play/Pause |Ctrl+Space | |

|Play selected area |Shift+Space | |

| | | |

|New annotation here (selected area) |Alt+N | |

|Delete annotation |Alt+D | |

| | | |

|Go to annotation above |Ctrl+Arrow up | |

|Go to annotation below |Ctrl+Arrow down | |

|Go to annotation to the right and edit |Ctrl+Alt+Arrow right | |

|Go to annotation to the left and edit |Ctrl+Alt+Arrow left | |

| | | |

|Move the left segment boundary a little to the left |Ctrl+J | |

|Move the left segment boundary a little to the right |Ctrl+U | |

|Move the right segment boundary a little to the left |Ctrl+Shift+J | |

|Move the right segment boundary a little to the right |Ctrl+Shift+U | |

Transcription rules

The following chapters can be used as a skeleton or as inspiration for a description of transcription rules for each language. Please note that if the transcriptions are going to be a part of a speech corpus, you still have to follow the instructions in chapter marked with * carefully.

1 Transcribe everything*

In principle, every word spoken in a recording should be transcribed. We have special rules for non-linguistic rules, sensitive information etc., see below.

If there are parts of a recording you don’t want to transcribe, please use a comment tag (see chapter xx) and describe what is left out and why.

2 When the transcription rules are not enough

Even though the transcription rules are detailed, you will probably find examples where you do not know what to do. Perhaps there is no rule for that exact example or perhaps the example is a borderline example. Please write down the example and where you found it so that the case can be discussed later.

3 Division of speech in segments *

When we transcribe, we must divide the speech flow into smaller parts (here called segments) in a meaningful way. In ELAN the sound is represented by a waveform and the different speakers have one tier each. We can therefore divide the speech flow into segments for each of the speakers on their own tier without consideration of concepts like turn-taking and replies. The result may look like the dialogue in a screenplay. The example below is from a transcription in Annotation Mode:

[pic]

1 Segmentation *

How big or small should a segment be? In a transcribed speech corpus we will preferably have neither too short nor too long segments. Firstly, we hope for segments that are complete sentences (even though we know that spoken language are full of hesitations, stops, stutters etc) because we want to annotate the corpus with an automatic part of speech tagger. Secondly, we hope for entities that provide meaningful results when we're searching for transcribed words via a search tool such as Glossa, see the picture below where Glossa gives the search word as a result together with the segment where the word belongs:

[pic]

All in all, an “ideal segment” probably lasts approximately 10 seconds and contains a longer meaningful entity. In the real word, here are some rules of thumb:

- A segment can contain one word only if the speaker only gives replies like yes, no, hm.

- If more than one speaker speaks at the same time: treat each speaker individually and segment/transcribe on each tier separately.

- If a speaker talks for a long time: try to divide the speech flow into smaller segments. Use

o Smaller pauses

o Intonation

o Completion of grammatical unit/sentence

[pic]

4 Annotate extra information and non-linguistic sounds

A normal conversation often contains non-linguistic elements like sniffing, coughing, laughing etc. The conversation may also contain information that should not be transcribed like introductory comments or sensitive information. Some parts of the speech can also be unclear and impossible to understand. We have developed a system for annotating such elements, see the chapters below.

1 General tag principles *

+ Dependent tags: The plus character and a letter characterize the following word or word group.

Example:

( “he” is pronounced unclear in the recording, that means that the transcriber is not sure that it is “he” being said

( ) Ordinary brackets are used around a group of words being characterized by a +-tag. Note that there are no spaces between the tag and the bracket.

Example:

( both «he» and «is» are pronounced unclear.

{} We mark comments with curl brackets (AltGr+7 og AltGr+0), see also chapter xx.

Example:

% Independent tags: The percent character is used for independent tags that represent an independent incident in the speech flow like laughter, coughing etc.

Example:

( %l the speaker laughs after she has uttered “that was fun”.

More than one tag can follow a word. The tag order is not important.

Døme:

( the words inside the brackets are unclear and are said with laughter.

Brackets inside brackets are not possible. If more than one word inside a bracket should be tagged, each word must be tagged separately (a) or the problem must be solved like (b).

List of dependent and independent tags *

Below is a list of dependent and independent tags. The tag names should be in English, but each language can choose their own tags with their own code as long as the tag system is kept as described.

|Dependent | |Independent | |

|+x |X | |Dialect word, word from another language, slang – not described in the |

| | | |dictionary |

|+u |Unclear |%u |unintelligible |

|+l |Laughing |%l |Laughter |

|+y |Yawning |%y |Yawn |

|+w |Whispering |%v |Whisper |

|+s |Sighing |%s |Sigh |

| | |%t |throat-clearing |

|+c | |%c |coughing |

| | |%q |(meaningful) clicking sound |

| | |%o |onomatopoeia |

Note that the same tags can be used both independently and dependently. In (a) the whole statement is said yawning, while in (b) the yawn comes afterwards.

3 Sensitive information and other elements from the recording that should not be transcribed *

Do not transkribe:

- Smalltalk about the recording before the conversation or interview has started. Use the comment tag and {preliminary comments} (on the interviewer tier):

[pic]

We also use {closing comments} in the same way about smalltalk in the end of a recording session.

[pic]

Disruptions in the conversation like when someone is coming into the room or someone is answering their phone etc. can be marked with the comment {disruption}.

- Use the comment tag {sensitive information} instead of transcribing sequences containing sensitive personal data. According to Norwegian Social Science Data Services this can be: Information about racial or ethnic origin, political, philosophical or religious beliefs, that one person has been suspected, charged or convicted of a criminal offense, health conditions, sexual relationships, membership in labor unions etc.

[pic]

- Names of friends, family, colleagues etc. There are special rules for names, see chapter xx.

5 Orthographic transcription

For standardized languages we will recommend an orthographic transcription. For other languages we will recommend a shallow phonetic transcription.

In the following chapters we will describe more what this means and what to do with words that are not described in the a dictionary or word list.

?? Here you perhaps need another description.

1 Main rules

We follow three main rules for transcription:

Main rule 1: Write word forms you find in a dictionary or word list.

?? New examples – these are Norwegian!

Main rule 2: Do not change the word order. Transcribe everything the speaker says, word for word, even if you think the word order is wrong according to syntactic rules.

?? New example!!!

Main rule 3: Use approved word forms from the dictionary even if they are used on the «wrong» place according to normative grammar.

?? New examples

1 Variation

To use a dictionary as a norm can in some cases seems brutal. But it is normally easy when you get used to it. Please note that we use the dictionary as a norm both regarding phonological root variation and variation in inflection suffixes.

?? Examples or remove the chapter?

2 Contraction

We do not mark contractions.

?? New examples or remove the chapter?

2 Exception from the main rule

Note that we keep the speakers gender on nouns!

?? ?? New examples or remove the chapter?

3 Words which is not in the dictionary or word list

Many spoken words are not in the dictionary. This may be dialect words, newer words, swearwords or words like eh, mm and the like. These words also have to be transcribed in one way or another and below we describe the guidelines for dealing with them.

1 Abbrevations

?? New examples or remove the chapter?

Abbreviations are written as abbreviations, but with small letters: dvd, cd, pc.

2 Compounds

?? New examples or remove the chapter?

Many compounds are not written in the dictionary, but should nevertheless be normalized in the usual way if you can find the last word of the compound in the dictionary.

3 Dialect

Even if a word is a word not found in the dictionary, we have to try to normalize it. Mark dialect words with +x.

?? New examples or remove the chapter?

4 Words from other languages

Treat words from other languages the same way as dialect words, that means tag them with +x. The words should be normalized according to a dictionary or wordlist on that language.

One exception to this rule is quotations:

?? New examples

5 New words, swearing

Treat new words, slang, swearwords etc not found in the dictionary or word list the same way as dialect words: Try to normalize them and tag with +x. Perhaps a search in a text corpus or on Google can help find a good normalized form.

?? New examples

4 Quotations

Quotations are written in quotation marks.

?? New examples

If a speaker talks about the pronunciation of a word, this is also considered a quotation:

Note that we do not use +x in quotations.

5 Interjections

Spoken language are full of words like eh, aha, em, ææææ, oiiiii etc. Some of these words can be found in the dictionary or wordlist as interjections, but not all of them. Please make your own interjection list with spoken interjections and their intended meaning, and insert it in XX.

Note that we not write vocal length. To letters means two syllables.

Some words are only onomatopoetic without a special meaning like uuuuuuuuu (sound from an ambulance) or uææææ (from a baby). You can use the independent tag %o for such cases.

6 Numbers

We write numbers with letters, not numerals.

?? New examples

Ordinal numbers are treated the same way.

7 Names

Normally, we do not transcribe person names. Instead we write codes:, F for female names (first name or first name plus surname), M for male names (first name or first name plus surname), and S for surname (without first name). To separate the different names, we also number them continuous: F1, F2, F3 etc.

?? New examples

Public persons like politicians, actors, football players etc. are not anonymized.

Name of places, football teams, pets and organisations etc. are not anonymized if there are no other sensitive information involved. If you want to anonymize a name like that, use codes like N1m N2 etc.

Names are normally written with capital initials. Names of books, films etc. are treated like quotations.

8 Non-linguistic sounds*

Beside interjections, speakers often make non-linguistic sounds like coughing, laughing, clicking sounds etc. It's easy to get caught up in individual sounds and spend a lot of time transcribing it accurately. Remember that the sounds we are most interested in are those that potentially mean something. Laughter almost always makes sense, but an uncontrolled cough does not. Still, coughing sometimes means that the speaker wants to get attention. Or perhaps the speaker discretely couches skeptically.

We only mark non-linguistic sounds we think have a meaning in the conversation and which is listed in chapter xx.

9 Breaks, pauses and unclear parts

1 Aborted words*

When a speaker starts saying a word and then stops before the word is completed, we transcribe what she says and write a – character behind: tr- pi- etc.

If a person stutters inside a word, we transcribe the word orthographically:

?? New examples

Another variant is when the first part of a compound is repeated as a strengthening:

Hesitation before a word is uttered is marked if we consider it as meaningful. Use the hesitation word ee.

2 Unfinished statements

If an utterance is not being properly ended and another speaker starts to speak, perhaps as an interruption, the unfinished statement are completed with three periods:

If a speaker interrupts one's self and start with another utterance, we do not transcribe …

?? New examples

Read about pauses in the next chapter.

3 Pauses*

Pauses can occur within or between segments (3.2.2). Within a segment a short pause or break is marked with #, and longer pauses with ##. Pauses between segments are only shown by the time distance between the segments. In the example below we see two pauses, one between two segments and one within a segment:

[pic]

A medium pause should be so long that in practice a pin could fall on the floor or longer (from about 1.5 seconds). A short break is shorter. Often, the pause can be seen as a straight line in the wave form. Below are first a medium long and then a short break:

[pic]

Long pauses are not that common. Perhaps they can be defined by saying that a long pause is a pause where the participants in a conversation will find the silence embarrassing.

Sometimes you can hear the speaker breathing. We consider this a pause and not a non-linguistic statement like laughter. If the breathing is more like a statement, use the tag for +s for sighing.

4 Unclear parts or words*

If there are parts of the speech you cannot understand, use the %u tag. If you think you can guess what the speaker says but is not 100 % sure, transcribe the word and use the +u tag.

?? New example

10 Capital letters and punctuation*

We use capital letters in names like in the written language, but we do not start a sentence/segment with a capital letter. We use question marks in questions and quote characters for citations. We do not use semicolon, colon, comma, exclamation characters or periods.

Proofreading

When proofreading, play the segments, one at a time.

Listen carefully and play unclear passages again. Correct the transcription if you find errors.

Overview: shortcuts, tags and interjections

1 Overview shortcuts

Segmentation Mode

|Function |PC and Mac | |

| | | |

|Merge a selected segment with next segment |Ctrl+A | |

|Merge a selected segment with the former |Ctrl+B | |

|segment | | |

|Divide selected segments |Ctrl+Enter[2] | |

| | | |

|Select a tier as the active tier |Arrow up | |

|Select the tier below as the active tier |Arrow down | |

| | | |

|Play/Pause |Ctrl+Space | |

|Play a selected area |Shift+Space | |

| | | |

Use the mouse to move the cursor or use the shortcuts.

Transcription Mode

|Function |PC and Mac |

| | |

|Go down to the next segment |Enter/Alt+Arrow down |

|Go up to the segment above |Alt+Arrow up |

| | |

|Play/Pause |Tab  |

|Play the seggment once more from the start |Shift+Tab    |

Annotation Mode

|Function |PC and Mac | |

| | | |

|Play/Pause |Ctrl+Space | |

|Play selected area |Shift+Space | |

| | | |

|New annotation here (selected area) |Alt+N | |

|Delete annotation |Alt+D | |

| | | |

|Go to annotation above |Ctrl+Arrow up | |

|Go to annotation below |Ctrl+Arrow down | |

|Go to annotation to the right and edit |Ctrl+Alt+Arrow right | |

|Go to annotation to the left and edit |Ctrl+Alt+Arrow left | |

| | | |

|Move the left segment boundary a little to the left |Ctrl+J | |

|Move the left segment boundary a little to the right |Ctrl+U | |

|Move the right segment boundary a little to the left |Ctrl+Shift+J | |

|Move the right segment boundary a little to the right |Ctrl+Shift+U | |

Moving the cursor

|Funksjon |PC and Mac |

| | |

|Move the cursor one second to the left |Shift+Arrow left | |

|Move the cursor a little to the left |Ctrl+Shift+Arrow left | |

|Move the cursor a little to the right |Ctrl+Shift+Arrow right | |

|Move the cursor one second to the right |Shift+Arrow right | |

2 Tags

|Dependent | |Independent | |

|+x |X | |Dialect word, word from another language, slang – not described in the |

| | | |dictionary |

|+u |unclear |%u |unintelligible |

|+l |laughing |%l |laughter |

|+y |yawning |%y |yawn |

|+w |whispering |%v |whisper |

|+s |sighing |%s |sigh |

| | |%t |throat-clearing |

|+c | |%c |coughing |

| | |%q |(meaningful) clicking sound |

| | |%o |onomatopoeia |

3 Lists of interjections

?? Make your own list of interjections

ELAN on a new computer

Make a shortcut to ELAN on the taskbar

a) Computer -> Local Disk (C:) -> Program File. Open ELAN (choose the newest version if there are more than one).

b) Pull ELAN.exe to the taskbar. 

Remove the possibility in Windows of turning the computer display:

Ctrl + Alt + F12.

Choose advanced mode --> options and support --> cross out Hot Key Functionality

If your language choise changes while using ELAN:

Og to the control panel. Choose «Change keyboards or other input methods» and «Change keyboards». Under «General» you can remove the languages you do not need.

You can also og to «Advanced Key settings» and remove the shortcut (Left Alt+Shift) which changes languages in Windows..

Define own shortcuts:

In ELAN you can make your own shortcuts. Choose Edit -> Preferences -> Edit shortcuts.

Here you can for instance define Ctrl + Enter to split selected segments (used as a shortcut in these guidelines).

-----------------------

[1] Note that this shortcut is not defined in ELAN. You can define your ovn shortcuts, see chapter xx.

[2] Note that this shortcut is not defined in ELAN. You can define your ovn shortcuts, see chapter 7.

-----------------------

+u he

+u(he is)

{break}

{preliminary comments}

{sensitive information}

{knock on the table}

{the speaker has the mouth full of candy}

that was fun %l

+l +u(bla bla)

(a) +l(bla +u bla +u bla)

(b) +l +u(bla bla bla)

(a) +y(i am tired)

(b) i am tired %y

The speaker says: We transcribe:

jei jikk på vægen jeg gikk på vegen

jei mener det ass jeg mener det altså

ska vi sjå skal vi se

hva du sa jeg skulle skylle ferdig før du pussa det?

The speaker says: We transcribe:

henne jikk henne gikk

vi snakka på det vi snakka på det

jei ga det til de jeg ga det til de

jei bruker ei maskin jeg bruker ei maskin

da får døm si det sjøl da får dem si det sjøl

de er sånn sjokoladepulver det er sånn sjokoladepulver

låkk ijen vindu lukk igjen vindu (ikke vinduet)

The speaker says: We transcribe:

itte ikke

søvi sovet

hestær hester

prate (presens) prater

The speaker says: We transcribe:

jei kan’ke gå jeg kan ikke gå

de æ’kke slik det er ikke slik

det kom no gutter det kom noe gutter

The speaker says: We transcribe:

ei maskin ei maskin

maskina maskina

de sa det på NRK

det ble vedtatt i FN

det var to dvd-er

a-endelse

l-lyden

bokstavkombinasjonen

trafikksituasjon

barne- og ungdomsarbeider

den fisken ser gøllei ut (= ekkel, dissete)

+x yes

+x (good choise)

jeg elsker ”Blow with the whistle”

da kjørte jeg den "jeg? hæ?" da ljuger jeg

jeg sier ikke ”sne” jeg jeg sier ”snø”

kjuefire

fir å kjue

hunndre å kjuefire

hunndre å fir å kjue

tre tus'n to hunndre å kjuefire

kjuefjære

hunndre å kjuefjære

The speaker says:

jæi så att Kari Hanns'n ga Nina dokumennt'ne till Nills'n

We transcribe:

jæi så att F1 ga F2 dokumennt'ne till E1

The speaker says:

inne-ba-bane, inne- m -bane innebane

We transcribe:

innebane

The speaker says:

kjemmpe-kjemmpe-kjemmpedæili

We transcribe:

Kjempe-kjempe-kjempedeilig

høres ut såmm sånn hær ...

høres ut såmm sånn hæ- ...

menn takk for att du %u

Han sa at (n visste)+u at di skulle kåmme +u

Interjeksjonar vi ikkje endrar stavemåten på:

ee (nøling – uansett lengde på een)

eh (avstandsindikerande)

ehe (”Eg forstår” – to stavingar)

em (nøling)

heh (imponert)

hm (spørjande, undrande. I Nynorskordboka i tydinga kremting

m (nøling, ta til etterretning, nam)

m-m (benektande)

mhm (”Eg forstår” – to stavingar)

mm (bekreftande – to stavingar)

åh (utrop)

Interjeksjonar vi kan tilpasse informanten sin uttale:

aha (overraska) Nynorskordboka

gud a meg (overraska, utrop)

huff a meg (beklaging) Nynorskordboka

hæ (spørjande) Nynorskordboka

jaha (forsterkande ”ja”) Nynorskordboka

næ (overraska, undrande)

nja (tvilande) Nynorskordboka

næhei (forsterkande ”nei”)

ops (oi då)

u (imponert)

uff a meg (beklaging) Nynorskordboka

ææ (konstaterande – to stavingar)

å-å (”oj”)

å ja (overraska)

................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download