Атты І халықаралық конференция ЕҢбектері

Table 1. The distribution of the speakers

жүктеу/скачать 8,57 Mb.

Pdf көрінісі

бет	233/326
Дата	07.01.2022
өлшемі	8,57 Mb.
	#19269

1 ... 229 230 231 232 233 234 235 236 ... 326

Байланысты:
Болатбек М. (1)

Total 30 28 23 20 22 12 21 13 169 I II

Table 1. The distribution of the speakers.
Age group
Region
F1
M1
F2
M2
F3
M3
F4
M4
Sum
1
3
3
2
1
2
1
2
1
15
2
2
3
2
1
2
1
11
3
1
1
2
3
2
1
1
11
4
3
2
1
1
7
5
2
2
2
1
2
2
2
1
14
6
2
2
2
2
2
1
2
13
7
2
2
1
2
2
2
1
12
8
2
1
1
2
1
1
2
1
11
9
3
2
2
1
3
1
1
1
14
10
1
1
2
2
1
1
2
1
11
11
2
1
2
1
1
2
9
12
2
2
2
2
1
2
1
12
13
2
2
2
1
1
1
1
1
11
14
2
1
1
1
1
2
1
2
11
15
1
3
1
2
7
Total
30
28
23
20
22
12
21
13
169
I
II
III
IV
34%
25%
20%
20%

Recording setup
The  actual  recording  sessions  took  place  in  a  sound-proof  studio  of  the  university  with  the
assistance of a sound operator. Before the recordings, the speakers were instructed, documented and
given some time to prepare as well as asked to fill in the copyright transfer form for the audio data
with their voice. They were not constrained on the manner, speed or time except for the correctness
of reading. The average time for a recording session per speaker was about 40-45 minutes, though
there were cases that last up to 2 hours.
Audio  data  were  captured  using  the  professional  vocal  microphone  Neumann  TLM  49  and
digitized  by  LEXICON  I-ONIX  U82S  sound  card.  The  format  of  the  recorded  audio  files  is  44.1
kHz  16-bit  PCM-encoded  mono  WAVE  file  format.  All  the  recorded  audio  files  were  manually
post-processed  to  have  each  utterance  (sentences  and  stories)  in  a  separate  file  and  in  the
corresponding directories. The size of the speech corpus is about 8.5 GB on disk. The total duration
of  the  audio  files  is  about  28  hours  with  23  hours  of  “sentences”  and  5  hours  of  “stories”  parts,
respectively.

жүктеу/скачать 8,57 Mb.

Достарыңызбен бөлісу:

1 ... 229 230 231 232 233 234 235 236 ... 326