NLP1000_Praat
This note has not been edited yet. Content may be subject to change.
20250905 NLP1000 Praat
Announcements
- may conference sa california
- sept 10-12, 25-27 (travel period 9-29)
- may conference sa yogyakarta
- nov 12-14
- walang klase sept 12, 19, 26
- makeup: oktubre (3 invited talks)
- notifications: hindi kaagad nakakatanggap ng notification kapag nag-comment; kapag may kailangan, ipadala sa inbox sa Canvas
- importante sa akin ang mga networking events dahil nakakatulong ako sa iba
- do activity 1
questions to ask yourself
- how do i grow as a person
- what happens when i come home
- how do i contribute to society
PSA: lakas ng loob; kapal ng mukha → use this to get deadline extension
International Decade of Indigenous Languages
- 2022-2032
- indigenous language - native to a region
- tagalog - indigenous language
- cebuano, hiligaynon, etc. - are not dialects
- language vs dialect
- dialect: variety of a language
- conyo is a socialect
- idiolect a smaller group of individuals
- gayspeak is a socialect
- bulacan variety of tagalog is a dialect; dialects are varieties of a language

what is a similarity matrix?
- tells you how similar and how dissimilar two indices (?) are

we're also going to produce a cluster

- once we have a word list, we apply clustering algorithms
Learning Plan

Pag-aaral ng wika
when we deal with languages we deal with:
- phonology: sounds
- morphology: affixes
- syntax
- semantics: grammar
- discourse and dialogue
Phonology
the science of (speech) sounds
if you're interested..
- ronald pascual - goat for audio processing in dlsu
- jocelyn cu - empathic computing, speech sounds like laughter
- thomas tiam-lee - advisee working on speech recognition
Ilan ang vowels sa Tagalog?

- pre-hispanic period: 3 vowels via baybayin system
- that's why there are only 3 vowel sounds in the tagalog language - that's why there are several spelling variants in the language today
but in modern tagalog:
slowly pronounce: a, e, i, o u, basa, biik, lulu
- guide questions: nasaan ang dila? mas malapit sa baba o sa taas ng bibig? paabante o paurong?
- touching the palate (above)? or touching oral cavity (below)?
kambal patinig
tongue:
a is usually downward
e is usually closer
u is moving towards
Formants

- tagalog vowels: a, e, i, o, u
- F1 refers to the tongue location (closer to palette or oral cavity)
- F2 refers to the tongue
- in terms of phonology, there are more foreign experts working on tagalog more than local experts
you can check vowel space in other languages
Activity 2


In Praat:
- waveform: visualization of the voice
- spectrogram:
- duration

-
monotonic view in praat (most are flat)
-
open > open from file > week 1 audio in Praat
-
(zoom in) by pressing "in" on bottom left



-
intensity: loudness/decibels
recording sound
- praat objects window > record mono (not stereo)
- mono: sound on left is the same on the right
- stereo: different
advanced:
praat objects > annotate > textgrid (need to download)


praat objects > query
if you want to get something from the entire file like minimum, maximum, etc.
thesis topics

- spectrogram
- speech generation: if i put a sad speech in tagalog, how can i make it happy?
- what is the emotion of the speaker from timestamp 1 to 4?