• Aucun résultat trouvé

Проблемы транскрибирования устных армянских текстов для корпусных исследований [Problèmes de transcription des textes oraux arméniens pour les recherches de corpus]

N/A
N/A
Protected

Academic year: 2021

Partager "Проблемы транскрибирования устных армянских текстов для корпусных исследований [Problèmes de transcription des textes oraux arméniens pour les recherches de corpus]"

Copied!
13
0
0

Texte intégral

(1)

Khurshudyan V. 2007. Problèmes de transcription des textes oraux arméniens pour les recherches de corpus [П о ле ы т к ов я у т ых я к х тек тов для ко пу ых ледов ] // NTI 2. Moscou. – 37-46.

. .

я

я

я

1

.

,

,

,

.

,

,

,

,

.

,

(

).

[1, 2, 3].

,

. [4, 5, 6, 7, 8],

. [9,10].

,

.

,

,

(

).

.

10

-

.

20-30

.

2003-2004 .

(

).

.

,

,

(6-8

),

.

,

,

,

.

6-8

,

,

,

.

.

160

,

20.000

:

.

,

(

).

Э

1 ( № 06-06-80470 )

(2)

2006 .

5

,

20-30

.

20.000

.

[11].

,

,

,

,

,

[12, 13].

2

,

, ,

,

,

.

,

:

1.

,

2.

,

,

3.

.

.

1.

я

,

,

.

.

Sylfaen (

.

[14]).

(

)

, .

1.

,

[15],

,

.

-

,

.

,

[16],

.

:

§

h

’,

‘t

h

’ - /t

h

/, ‘p

h

’ - /p

h

/

. .

§

‘ ’, ‘ ’

ʁ

’, ‘

c

,

,

(

).

2 . . . . . ( ) - , . , , , .Т (INALCO, П ) ( 1,5 . ) -(http://a.donabedian.free.fr/textes/Noojarmenien.pdf). , , , ,

(www.digilib.am, http://titus.uni-frankfurt.de, http://www.indo-europees.leidenuniv.nl/index.php3?c=64,).

(3)

-

/ł/

/x/

.

,

/ł/

,

.

§

-‘ ’ – -‘j’

‘ ’ – ‘ ’,

,

[16],

ʒ

‘ ’

.

§

/u/,

,

-

.

/u/,

’.

,

‘u’.

§

,

,

,

,

‘ ’, ‘ ’, ‘ ’,

,

‘ye/e’, ‘vo/o’, ‘yev/ev’

.

,

1

,

.

,

3

,

,

-.

1. А я я Загл. С . Загл. С . - 3 LC4 BGN/ PCGN5 ISO6 KNAB7

A

a

a

a

a

a

a

B

b

b

b

(p)

b

b

b

G

g

g

g

(k) g

g

g

D

d

d

d

(t)

d

d

d

Ye,

E

ye, e

e

e,

y e,

ye e e,

ye

Z

z

z

z

z

z

z

ē

ē

e

ë

è

Ə

ə

ə

ĕ

y

ə

y

T

h

t

h

t’

t’

t’

t’ th

Ž

ž

ž

zh

zh

ž

ž

I

i

i

i

i

i

i

L

l

l

l

l

l

l

c

c

x

kh

kh

x

x

3 , [15] 4

American Library Association – Library of Congress, 1997 .

5

United States Board on Geographic Names, 1981 .

6

International Standards Organization (www.iso.ch), 1996 .

7

(4)

C

c

c

ts (dz) ts

ç ts

K

k

k

k

(g) k

k

k

H

h

h

h

h

h

h

ʒ

ʒ

j

dz (ts) dz

j dz

ʁ

ʁ

ł

gh

gh

ġ

ğ

Č

č

č

ch (j) ch

č

č

M

m

m

m

m

m

m

Y

y

y

y,

h

y

y

j

N

n

n

n

n

n

n

Š

š

š

sh

sh

š

š

Vo, O vo, o

o

o

o,

vo o o

Č

h

č

h

č’

ch’ ch’

č

čh

P

p

p

p

(b) p

p

p

j (ch) j

ĵ dž

rr

ř

S

s

s

s

s

s

s

V

v

v

v

v

v

v

T

t

t

t

(d) t

t

t

R

r

r

r

r

r

r

C

h

c

h

c’

ts

ts

c’ ch

U

u

w

w

-

w

w

P

h

p

h

p’

p’

p’

p’ ph

K

h

k

h

k’

k’

k’

k’ kh

(

)

( ) Yev

yev, ev

ev

ew

ev, yev ew ew

ō

ō

o

ò

ò

f

f

f

f

f

f

f

2.

я

я,

.

,

(

,

[17]).

(

SG

)

/

(

NOM

/

ACC

)

, ,

,

(

3

2, 3, 5, 6, 7, 9, 10

.).

.

,

,

.

3.

я

я

,

,

,

,

,

«

»

. .

. .

,

.,

(5)

, [4] (

-

).

,

[4],

.

,

,

(

,

‘a’=

:

PRS

:3

SG

‘ē’, .

3 –

4, 5, 6, 8),

ēs

’,

ēd ‘

’,

ēn ‘

ays, ayd, ayn

.

3 –

3)

. .

,

,

,

,

.

(

Э ),

.

Э

:

§

-

-

Э

§

-

Э

.

,

,

,

,

Э ,

(

,

,

,

. .;

,

,

,

,

. .)

,

(

. [18]).

,

,

.

8

.

1. (INF) (RES) (CVB) (SBJ) (NOM) (

3 –

. 9, 17, 18)

/

(GEN/DAT)

(

3 –

. 5, 20]

(ABL) (INS) 8 8 , 4 . , .

я

я

(6)

,

,

.

,

,

. .

/

(

. [18]).

,

Э .

,

,

.

,

,

,

,

,

,

.

,

,

.

,

,

,

.

,

,

.

,

Э ( .

3 –

6, 8, 9).

,

.

,

.

,

(

,

. [4]):

0.1

0.1

0.4

(

)

..(0.3)

0.5

0.9

(

)

...(0.7)

1

....(1.2)

,

:

1.

,

:

)

(

Э )

)

(

Э )

2.

,

:

)

)

.

3.

,

:

)

,

,

)

,

,

.

:

§

+

,

§

+

,

§

+

+

,

(7)

§

+

+

.

,

.

,

,

( ) (d)

,

.

,

,

,

. .

h

(

):

(1)КК-ARM R1-69 я, я ( + + ) ..(1.3)h əm(0.7) …(0.8) čhkaroʁac h av /gnel, čh -karo

ʁ

ac h -av gn-el - -

-NEG- -AOR:3 -INF

‘ ,

khani vor ayn == ..(0.2) ginə šat \barʒr ēr.

khani vor ayn gin-ə šat bar

ʒ

r ēr -DEF :AUX:PST:3SG .’

,

.

,

,

,

,

,

(

. [19],

),

.

[21, 20, 19, 22, 23],

.

,

,

,

,

,

,

.

,

,

,

,

,

(

– [24],

– [25, 19, 26],

– [27]).

,

.

(

)

9 . я я, яя

(8)

.

,

,

,

-

[25, 19, 6, 24, 28],

,

.

-

,

,

.

2.

гла

-за л е ые

за л е ые

гла

-гла

за л е ые

гла

-гла

1.

əə

2.

əh

3.

ah

1.

ə

ʁ

2.

3.

əm

4.

əhm

5.

aəm

6.

Сəh

7.

Сəm

1.

əСəm

2.

əЦə

3.

əЦəh

4.

əЦə

ʁ

5.

ə

ʁh

6.

ə

ʁm

7.

ə

ʁ

ə

8.

ə

ʁ

əm

9.

ə

ʁ

əЦə

ʁ

,

,

.

,

,

.

,

,

,

.

,

(

. .

),

(

. .

,

/

) (

. [19],

,

).

,

,

.

(2) КК-ARM T2-7

....(1.3)h ə

ʁ

əmə

ʁ

(1.2) ..(0.1) De-e(0.1) sra pat

h

osə čhaphazanch /parz ē, de sra pathos-ə čhaphazanch parz ē

-

:GEN -DEF :COP:PRS:3SG

‘ , [ ] ’

,

-,

,

[6, 29, 30].

1.

a. [a], [

ə], [aə], [ə:]

2.

a.

[m]

b.

[h]

c.

[

ʁ

],

(9)

.

,

,

,

-

.

,

.

,

,

,

,

,

.

(3) КК-ARM T1-1

…(0.6) /–yev-v(0.3) ..(0.2) əə(0.4) yeritasard /t

ʁ

amardə /–vorošum ē-ē(0.2) …(0.5) əə(0.6) /gnel kno hamar ..(0.1) /phokhrik ..(0.2) /ba-aych(0.2) ..(0.1) /harmaravet-t(0.2) əə(0.3) \avtomekhena.

yev yeritasard t

ʁ

amard-ə voroš-um ē gn-el

- -

-DEF -IPFV :AUX:PRS:3SG -INF kn-o hamar phokhrik baych harmaravet avtomekhena

- -GEN/DAT ‘ , .’

/

.

,

.

Speech Analyzer

10

.

: /

, \

, /–

-,

\–

-

,

, /\

-

, \/

-.

,

,

,

.

,

,

,

,

,

.

/

.

,

,

/

.

: ‘.’ –

ц «

щ

я»,

‘_’ -

, ‘¡’ -

, ‘?’ -

, ‘~’ –

,

‘@’ -

.

(

J),

(

☻)

( .

3

14],

(

,

,

,

. .)

.

10

Speech Analyzer 1.5 Speech Manager 1.5, IPA Help 2.0, and AudioCon 3.2 Copyright (c) 2000 SIL International.

(10)

,

,

.

,

.

,

.

3. КК-ARM R1-5 t N TEXT 1.7 1. ..(0.4)h Tak /uremn, tak uremn Т , ,

2.7 2. əə(0.5) ...(0.5)h mi garnanayi-in(0.2) ..(0.1) paycaŕ arevot /ōr ēr, mi garnanayin paycaŕ arevot ōr ēr

:AUX:PST:3SG

, 5.9 3. u ēd /amusinneri-i(0.3) ..(0.3)h

əm(0.3) ...(0.6)h

amusnakan /taredar

ʒ

n ēr, u ēd amusin-ner-i amusnakan taredar

ʒ

-n ēr

- -

-PL-GEN/DAT -DEF :AUX:PST:3SG

, , ,

9.6 4. u inčhpes-s(0.5) ..(0.4)h petkh a /liner, u inčh

pes petkh a lin-er

:AUX:PRS:3SG -SBJV:PRS:3SG ,

11.5 5. /amusinə …(0.5)h

/gnum a ..(0.1) /nver ..(0.1) phntrelu ir kno \hamar. amusin-ə gn-um a nver phntr-el-u

- -

--DEF -IPFV :AUX:PRS:3SG -INF-GEN/DAT ir kn-o hamar

-

:GEN -GEN/DAT

.

14.7 6. ..(0.4)h Gnum a tarbercanuthner a man /galis, gn-um a tarber canuth-ner a

-

--IPFV :AUX:PRS:3SG -PL :AUX:PRS:3SG man gal-is -:NVP -IPFV: VVP Х , 16.9 7. ..(0.1)h vočh mi ban čhi /gtnum, vočh mi ban čh-i gtn-um -

:INDF NEG- :AUX:PRS:3SG -IPFV

,

18.2 8. ….(1.2)h heto galis a ērexekhi het /corhrdakchum a, heto gal-is a ērex-ekh-i het

- - -

(11)

corhrdakch

-um a

--IPFV :AUX:PRS:3SG

,

21.4 9. ..(0.7)h \–yerecanerə-ə(0.3) ..(0.2) corhurd en talis gnal mekhena /aŕnel, yereca-ner-ə corhurd en tal-is gn-al mekhena aŕn-el

- - - -

--PL-DEF :AUX:PRS:3PL -IPFV -INF -INF ,

25.0 10. khani vor ..(0.4)h ayd /mekhenan == khani vor ayd mekhena-n

-DEF

==

26.5 11. yev-v(0.3) ..(0.1) /patčaŕabanum en, yev patčaŕaban-um en

-IPFV :AUX:PRS:3PL ,

27.5 12. vor yev-v(0.2) anhražešt ēr yev irench tan /hamar, vor yev anhražešt ēr yev irench

tan hamar :AUX:PST:3SG :GEN :GEN

[ ] ,

29.4 13. yev ..(0.1) mayrikə šat /kuracana, yev mayrik-ə šat k-uracan-a

- -

-DEF COND- -PRS:3SG ,

31.0 14. ...(0.5)h

\ayspisov …(0.7)

əntanikhi hayrə gnum ē \mekhenan= ..(0.1) /canuth, ayspisov əntanikh-i hayr-ə gn-um ē mekhena-n canuth

- - -

-DEF -IPFV :AUX:PRS:3SG -DEF

, = ,

34.4 15. vorte

ʁ

mekh

enaner en /vačaŕum, vorte

ʁ

mek h ena-ner en vačaŕ-um - -PL :AUX:PRS:3PL -IPFV ,

36.0 16. ...(0.6)h aəm(0.6) canothanum ē /gnerin, canothan-um ē gn-er-i-n,

- -

--IPFV :AUX:PRS:3SG -PL-GEN/DAT-DEF ,

38.4 17. yev kh

ani vor čhi karo

ʁ

anum ..(0.4)

h

gnel /ayn,

yev khani vor čh-i karo

ʁ

an-um gn-el ayn

- -

NEG- :AUX:PRS:3SG -IPFV -INF

,

40.5 18. vorošum ē gnelca

ʁ

alik

h

/mekhena, voroš-um ē gn-el ca

ʁ

alik

h

mekhena

-

--IPFV :AUX:PRS:3SG -INF ,

42.2 19. ...(0.9)h əh(0.2)ca

ʁ

alik

h

mekhenan tun ē /berum, ca

ʁ

alik

h

(12)

12

:AUX:PRS:3SG -IPFV ,

44.6 20. yev-v(0.2) yerb vor ..(0.2)h

nvernerə phocanakelu /žamanak ..(0.1) talis ē ir /mekhenan,

yev yerb vor nver-ner-ə phocanak-el-u žamanak

- -

-PL-DEF -INF-GEN/DAT tal-is ē ir mekhena-n

-

--IPFV :AUX:PRS:3SG :GEN -DEF

, 48.2 21. kinə \šat == kin-ə šat --DEF , 48.6 22. ...(0.7)h əm(0.4) ..(0.3)h

aydkhan durə čhi galis /kno ə,

aydkhan el dur-ə čh-i gal-is kn-o -ə

- - - -

-DEF:NVP NEG- :AUX:PRS:3SG -IPFV:VVP -GEN/DAT-DEF

,

51.5 23. sakayn \yerecanerə šat en \uracanum_

sakayn yereca-ner-ə šat en uracan-um

- -

-PL-DEF :AUX:PRS:3PL -IPFV .

1. Khurshudyan V. Hesitation Markers in Oral Narratives in Armenian and Italian. // VII International Conference on Cognitive Modeling in Linguistics, Varna, Bulgaria, 2004, pp. 125-135. 2. . . . // . . , А , 2005 . 3. . , Э -. // К : " '2005". .: , 2005. 4. К A.A., . . К : . // - . 2: , 2003 № 10, . 5-11.

5. DuBois J. W., Schuetze-Coburn S., Cumming S., Paolino D. Outline of discourse transcription // Talking data: Transcription and coding in discourse research / Hillsdale, N. J., 1993, pp. 45-89.

6. Edwards J. A. The transcription of discourse // The handbook of discourse analysis. Oxford: Blackwell, 2001, pp. 321-348.

7. MacWhinney B. The CHILDES Project: tools for analyzing talk, v. I-II (third edition), Lawrence Erlbaum Associates, New Jersey, 2000.

8. MacWhinney B. & Snow C. The Child Language Data Exchange System. // Journal of Child Language, # 12, 1985, pp. 271-296.

9. К А.А. А .

. . 2003.

10. . . .

(13)

11. . ., . . :

. // К :

" '2006", . 2006. . 429-439.

12. А. . .

. . 1983.

13. А (1981). amanakakich hayereni cosakchakan lezun [

]. .

14. Sanjian A. K. The Armenian alphabet. // The world's writing systems. New York & Oxford: Oxford University Press. 1996, pp. 356-363.

15. ., Hayagitakan usumnasiruthyunner [А ].

. 2004, pp. 56-75.

16. К . ., К . . . ., 2001.

17. The Leipzig Glossing Rules, version of September 2004, http://www.eva.mpg.de/lingua/pdf/LGR04.09.21.pdf.

18. К А.А. . .

// . . ( .) К . . 1. :

А , 2006, .138-158.

19. Clark H. H. & Jean E. Fox Tree. Using uh and um in Spontaneous Dialog. // Cognition, 2002, #84, pp. 73-111.

20. Godfrey J.J., Holliman E.C., McDaniel J., Switchboard: Telephone speech corpus for research and development. // Conference on Acoustics, Speech and Signal Processing. San Francisco, 1992, pp. 517-520.

21. James D. Some aspects of the syntax and semantics of interjections. // Eighth Regional Meeting of the Chicago Linguistic Society, Chicago: Linguistics Department, University of Chicago, 1972, pp. 162-172.

22. Clark H. H. Pragmatics of language performance. // Handbook of pragmatics. Oxford: Blackwell, 2004, pp. 365-382.

23. Engel D., Charniak E., Johnson M. Parsing and Disfluency Placement, // Conference on Empirical Methods in Natural Language Processing, Philadelphia, July 2002, pp. 49-54.

24. . А., К . ., . . .

. . . .: , 1981.

25. DuBois, J. W., Cumming S., Scheutze-Coburn S., Paolino D. Discourse Transcription. UCSB, 1992.

26. Ward N. Issues in the Transcription of English Conversational Grunts. // First (ACL) SIGdial Workshop on Discourse and Dialog, 2000, pp. 29-35.

27. Cresti E. Corpus di italiano parlato, v. I-II, Accademia della Crusca, Firenze, 2000.

28. Shriberg E.E. Preliminaries to a Theory of Speech Disfluencies. Doctoral dissertation. University of California at Berkeley. 1994.

29. Kawamori M., Kawabata T., Shimazu A. Discourse markers in spontaneous dialogue: a corpus based study of Japanese and English. // ACL98 Workshop of Discourse Relations and Discourse Markers, 1998, pp. 93-99.

30. Swerts M., Wichmann A., Beun R.-J. Filled Pauses as Markers of Discourse Structure. // ICSLP, Philadelphia, USA, 1996.

Références

Documents relatifs

Après les Allemands viennent, comme importance de courant d'émigration, les Irlandais, au nombre de 50,000, etlesSuédo-Norvégiens (25,000) ; mais en général ces émigrants

— C'est lé Monde, dit Sarî, ïe Monde, qui brûle d'amour pour nous ; mais comme nous ne faisons pas attention à lui, il vient ici pour se faire voir et balaye la

Des diverses probabilitds intervenant dans la thdorie des assurances in- validitd et nuptialitd.. - - Sur le calcul des valeurs de certaines probabilitds

Experiments in monolingual English and cross-language French-English speech retrieval obtained the best results by augmenting ASR transcripts with automatically generated keywords.

s2hmhdç =|]Î]Î]-ñ pzciW„W„`ƒKvu°h’ñ uZ[\ï O\Ma`[\cihiWZciQTkohonèÑWrciéaèNWZQ©óaOkmW’`GkmhiW„ï×ïfW„cdk­pzcd[\ïs2hmhdç =RkiMV[R]ÎóaO\èNè©ñ

Les raisons d'ordre supérieur qui ontengagé le législateur fédéral à exiger le paiement des salaires en monnaie métallique ou fiduciaire et à exclure les paiements en nature

S ’ y ajoutent les anticorps monoclonaux ciblant le récepteur PD-1 ou les ligands (PDL-1 et PDL-2) [PDL pour programmed cell death, of course!], qui sont développés bien sûr dans

Il y a une augmentation significative de la survie sans progression en faveur de la quadrithérapie ainsi que sur le taux de réponses objectives, et ce gain en taux de