NLP1000_Praat

RAW FILE

This note has not been edited yet. Content may be subject to change.

20250905 NLP1000 Praat

Announcements

  • may conference sa california
    • sept 10-12, 25-27 (travel period 9-29)
  • may conference sa yogyakarta
    • nov 12-14
  • walang klase sept 12, 19, 26
  • makeup: oktubre (3 invited talks)
  • notifications: hindi kaagad nakakatanggap ng notification kapag nag-comment; kapag may kailangan, ipadala sa inbox sa Canvas
  • importante sa akin ang mga networking events dahil nakakatulong ako sa iba
  • do activity 1

questions to ask yourself

  • how do i grow as a person
  • what happens when i come home
  • how do i contribute to society

PSA: lakas ng loob; kapal ng mukha → use this to get deadline extension

International Decade of Indigenous Languages

  • 2022-2032
  • indigenous language - native to a region
  • tagalog - indigenous language
  • cebuano, hiligaynon, etc. - are not dialects
  • language vs dialect
    • dialect: variety of a language
  • conyo is a socialect
  • idiolect a smaller group of individuals
  • gayspeak is a socialect
  • bulacan variety of tagalog is a dialect; dialects are varieties of a language
    _attachments/Pasted image 20250914235947.png

what is a similarity matrix?

  • tells you how similar and how dissimilar two indices (?) are
    _attachments/Pasted image 20250915000022.png

we're also going to produce a cluster
_attachments/Pasted image 20250915000049.png

  • once we have a word list, we apply clustering algorithms

Learning Plan
_attachments/Pasted image 20250915000214.png

Pag-aaral ng wika

when we deal with languages we deal with:

  • phonology: sounds
  • morphology: affixes
  • syntax
  • semantics: grammar
  • discourse and dialogue

Phonology

the science of (speech) sounds

if you're interested..

  • ronald pascual - goat for audio processing in dlsu
  • jocelyn cu - empathic computing, speech sounds like laughter
  • thomas tiam-lee - advisee working on speech recognition

Ilan ang vowels sa Tagalog?

_attachments/Pasted image 20250915000543.png

  • pre-hispanic period: 3 vowels via baybayin system
  • that's why there are only 3 vowel sounds in the tagalog language - that's why there are several spelling variants in the language today

but in modern tagalog:
slowly pronounce: a, e, i, o u, basa, biik, lulu

  • guide questions: nasaan ang dila? mas malapit sa baba o sa taas ng bibig? paabante o paurong?
    • touching the palate (above)? or touching oral cavity (below)?

kambal patinig

tongue:
a is usually downward
e is usually closer
u is moving towards

Formants

_attachments/Pasted image 20250915001023.png

  • tagalog vowels: a, e, i, o, u
  • F1 refers to the tongue location (closer to palette or oral cavity)
  • F2 refers to the tongue
  • in terms of phonology, there are more foreign experts working on tagalog more than local experts

you can check vowel space in other languages

Activity 2

_attachments/Pasted image 20250905183730.png
_attachments/Pasted image 20250905183916.png

In Praat:

  • waveform: visualization of the voice
  • spectrogram:
  • duration

_attachments/Pasted image 20250905184337.png

  • monotonic view in praat (most are flat)

  • open > open from file > week 1 audio in Praat

  • (zoom in) by pressing "in" on bottom left
    _attachments/Pasted image 20250905184708.png_attachments/Pasted image 20250905185127.png
    _attachments/Pasted image 20250905185339.png

  • intensity: loudness/decibels

recording sound

  • praat objects window > record mono (not stereo)
  • mono: sound on left is the same on the right
  • stereo: different

advanced:

praat objects > annotate > textgrid (need to download)
_attachments/Pasted image 20250905190334.png
_attachments/Pasted image 20250905190422.png

praat objects > query

if you want to get something from the entire file like minimum, maximum, etc.

thesis topics

_attachments/Pasted image 20250905190613.png

  • spectrogram
  • speech generation: if i put a sad speech in tagalog, how can i make it happy?
  • what is the emotion of the speaker from timestamp 1 to 4?