Spectrograms - University Of Texas At Austin

3y ago

16 Views

2 Downloads

1.63 MB

11 Pages

Last View : 1m ago

Last Download : 3m ago

Upload by : Jenson Heredia

Report this link

Download PDF

Transcription

Lecture 3SpectrogramsIt is tough to get timing info from a FFT: we saw that back in the Week 2 lecture, on Power and Phase. In fact,the FFT had a hard time telling whether things were going forward or backward. The information is completelylost when we look at power alone; it is kept in the phase, but is coded in ways that are hard to read.The spectrogram is a clever trick to get time info. Say you have some data sampled at 1024Hz, and you have3min of data -- 3072 data points. Chunk the data into pieces of size 128, and take a 128-point FFT. Each FFTwill represent what has happened over an eighth-of-a-second. So you can track how the frequency changes overtime!Unfortunately, it ain’t so easy: the Uncertainty Principle also has a word or two to say. A 128 point FFT of datasampled at 1024Hz can only see 8 frequency chunks. So you lose a good sense of what frequencies happenwhen.That’s uncertainty. The hope is, that by adjusting the size of the chunk, you can detect the frequencies you want.The technical term for “chunk” is “window”, and you’ll want to adjust window size.There’s a second issue: say your window is 128 points long. Win1 covers points 1 to 128; Win2 points 129 to256, etc. But what if the event you want to see lasts from point 120 to 175? It’s a long event, so it’s no good trying to change window length. The problem is with where the windows are placed, not how long they are. If onlythe original window could be slid back eight points. But then it’d overlap eight points with the data in Win1.So, while we’re thinking of programming this, we also ought to implement overlap. Just in case.Finally, we’re getting a bit too much data: we have frequency, and power -- but now we have time. That’s a 3Dgraph!Fortunately, there’s a Matlab function that does it all; it’s called spectrogram. Let’s try it. x linspace(0, 1, 1024);% I’m taking the interval from zero to one as my basic unit of time, and pretending that means% “one second”. I chunk it into 1024 pieces, so this gives me a sampling rate of 1024Hz. s1 sin(2*pi*80*x); s2 sin(2*pi*160*x);% Two sine waves, at different frequencies. I want to see how uncertainty affects my% ability to see the two frequencies. s [s1 zeros(1, 128) s2 zeros(1, 128) s1 s2];% A signal starting with s1, pausing for 128/1024 sec, then s2, another pause, then% the two frequencies together.

Now comes the big one: spectrogram(s, ones(1, 256),0,256, 1024, ‘yaxis’)Nice picture . what’s it mean?Let’s start with spectrogram(S, B, C, D, E, ‘yaxis’)S is the signal.B tells the window to use. ones(1, 256) gives bunch of zeros, followed by 256 ones, then back to zeros. Asimple on/off window of length 256 points. You could taper at the ends, do anything you wanted, by changingones(1, 256) to a different set of numbers.C tells the overlap between windows; here it’s set to zero. This will make the graph look chunky: chunks oflength 256, to be exact!D tells me how long a FFT to use. 256 seems to make sense with a length 256 window.E tells me the sampling frequency. That allows Matlas to label the x-axis with the correct time, and the y-axiswith the correct frequency.And speaking of: ‘yaxis’ tells Matlab to put the frequency along the y-axis.One second here: if x-axis is time, y-axis frequency, then z-axis is power. But -- spectrogram uses a trick: it usesfalse color to indicate the z-axis. (Actually the z-axis is still there, but for now think: color height)

For comparison: spectrogram(s, ones(1, 128),127,128, 1024, ‘yaxis’)Here we have a shorter window, which makes things look less chunky, and we have an overlap of 127 points-- which produces a very smooth looking graph. With much amazing and interesting detail.Very pretty, and one can spend hours, harmlessly playing with spectrograms. Let’s look at the information wecan get from it. Start by magnifying the picture until you can see the scales. The y, or frequency, axis runs from0 to 512. That’s because the sampling rate is 1024Hz, so the Nyquist limit is 512Hz. Similarly, the x or timeaxis is a bit over 3 seconds. That’s because we took three one second sounds with two eighth-second pauses.Now look at the picture itself. The color bar at the right tells us that the deeper reds represent the frequencieswhere the power spectrum is highest. In the first second of time, that’s concentrated between 80 and 90 Hz. Wein fact know that the s1 signal is a sine with 80Hz. We’ve lost 10Hz of resolution! That’s about right, with aFFT of length 128: 1024/128 8Hz chunks. If you look at the spectrogram on the previous page, the power ismuch more clearly centered mear 80 -- in fact, the chunks are size 4Hz.The next feature to note is the transition between s1 and s2. The spectrogram on the previous page has a widebar, of length about .25 second, that represents the gap between s1 and s2. Let’s check that: the spectrogram onthat page has length 256 points. 256/1024 is indeed 1/4 of a second. This FFT cannot discriminate any smallertime interval, so the color of the bar, representing the power in that section, is constant across the whole .25sec.For the picture on this page, that transitional interval is about 1/8 of a second long, which is exactly correct.However, there’s something troubling in the computation: the interval between s1 and s2 is filled with 1/8 second of silence. There should be NO spectral power in that region. Yet both the spectrograms show some deepred. What’s this about?

64 (that factor of 2). Here . . . 58? What’s *that* about?Weird III) There are values of z71, besides the value at the peak. Values, for example, to the left and right of theLet’smorecloselyfalseat thegap: I’vereadings,turned it reportedon it’s side.peak.lookTheserepresentfrequencyat 32hz, 96 hz, and a bunch others. That didn’t happen for z64! For z64, there was a large value at the peak frequency, then zeroes everywhere else. So, what is thedeal? Why does z71 give false frequencies?In stead of an immediate answer, right away, we want to explain where the numbers came from. So that wecould at least reproduce the false values and false frequencies, independent of Matlab. Later will come a realexplanation.The color is deepest red right at 160Hz, the frequency of s2. But there’s an additional regular pattern of highpowerbars, ewillredsremember,and greensisbothThe explanationit’sthedueto ryonewhatfadewe asgetwegofurtherfromthecenterfrequencyof160Hz.when I take an exponential, sampled at frequency M, and then take the FFT at N points. We did that a few pagesback, and got the formula asThis pattern is the color of a Dirichlet kernel: from Week 2, The bars are there because we chose a simple window -- instant on, instant off. The FFT of the window is aDirichlet function. In short: the bars represent the FFT of the window, not the FFT of the empty space. Thisprovides a powerful reason to take windows that aren’t instant on/instant off.There’s one more lesson to learn from the spectrogram:

spectrogram(s, ones(1, 64),0,64, 1024, ‘yaxis’)The gap between s1 and s2 is represented by a thick blue bar. The color chart shows that blue represents thelowest possible power. This spectrogram gets it right! How does that happen? First, because the window size is64, which means that the window fits neatly into the gap: in this window, there truly is no power in the signal,and the FFT reports this accurately. Second: there’s no overlapping-- we set the overlap to zero -- so the windowdoesn’t slop over into portions where there’s s1 or s2. Compare with spectrogram(s, ones(1, 64),63,64, 1024, ‘yaxis’)

The window size is still 64, so it still sits inside the gap. But the overlap of windows is about 98%, and so theempty space is contaminated.Lab Project: Let’s start with hayat.wav, Hayat Sana Tesekkur Ederim, from the album Deliveren by the Turkishsinger sezen Aksu. It’s a nice song to do a spectrogram on; it starts with voice, then a simple and finally a morecomplex orchestral accompaniment. You get to see where the spectra of voice & instruments lie. It is sampledat industry standard 44100Hz, which may be too much data for your computer. In that case, downsample by afactor of eight or sixteen!Next, Isilongo.wav, from the Mahotella Queen’s song Isilongo Sesoka. In Isilongo, the singer uses one of theclicks in an African click language. Many linguists believe that the click languages are the ‘oldest’ humanlanguages (what can *that* mean?). Your task is to use a spectrogram to locate the click. Then fiddle with thewindow length until you can locate the beginning of the click as accurately as possible. Now plot the sound itself, and magnify it using Matlab graphing tools, to locate the beginning of the click. How do the two estimatesof the click onset compare?For the next application, we’re going to look closely at the first four phonemes of the song Isilongo.

Of course the click ! looks very different from the other sounds. Here’s what the sound -- not the spectrum-looks like:It’s reminiscent of the artificial clicks we constructedin the Uncertainty supplement of Chapter Three.Those were characterized by a fundamental frequency-- which you can see as a dark band in the spectrogram -- followed by high frequency harmonics, butfalling off slowly in power.We’ll be focusing, however, on the vowels O, AA. EE, which have very different spectra than the consonants.Therein lies a story, about speech production in humans.In the vowels O and AH, you see very dark bands, regularly spaced. Isolating the sound, it looks like:And the spectrum confirms what the graph suggests:The spectrum consists of a fundamental and some fairly noisy harmonics. The reason for this has to do with theway the vocal tract produces vowels. Air is forced from the lungs, through the vocal folds, which vibrate and are

re-inforced through the vocal tract. It isn’t very different from our examples from the Exponentials supplementto Chapter Two, where we looked at spectra of a chime and a didgeridoo. Vowels are produced by the same kindof open tube, forced at one end.Contrast the SS sound:The last thing it looks like is a fundamental and one or two harmonics; it looks more like random noise. And, ifthe spectrogram is any evidence, it has much more high frequency content than a vowel does.How are consonants like SS produced?Finally, a point from the baby boomer set. As humans age, their perception of high frequencies deteriorates.This makes it difficult to detect consonants, and because of this, speech can sound like a collection of vowelsall blurred together. Any system that can boost consonant volume, while leaving vowel volumes alone, mightbe used as a hearing aid. Here’s a paper on the basics of using wavelets for detecting and altering the differentcomponents of human speech.Finally, we’ll look a little more closely at the spectrograms. Compare spectrogram(s, ones(1, 256),0,256, 1024, ‘yaxis’) spectrogram(s, ones(1, 256),255,256, 1024, ‘yaxis’)The spectrogram with the massive overlap seems to have all kinds of very fine detail -- even in the first third ofthe graph. What is that?We’ll take a simple model: x linspace(0, 1, 1024); s sin(2*pi*8*x); plot(x, 100*s) hold spectrogram(s, ones(1, 16),15,16, 1024, ‘yaxis’)Here’s two views of the same spectrogram:

This is the “fine detail”, in close-up, and the blue curve is my sine curve. Looking at it, it’s tempting to think,‘wow! the spectrogram is picking out the individual peaks of the sine curve. Groovy! Unfortunately, red indicates power; yellow indicates significantly less power.This would be easier in 3D. And Matlab has this amazing tool, the ‘rotate’ tool, that allows just that. Here’s thesame graph, but rotated so the high and low are clearly visible:

This is a good time to ask, ‘what is going on here?’ Start by following one frequency in the FFT. We’ve forced itinto a window of length 16, so its real and imaginary components look like tiny sine or cosine waves.In contrast, the signal s itself has frequency 8Hz, so it takes 128 points to go through a cycle. In contrast to theFFT, it is a long slow wave. Moreover, as the window overlap is a full 16 points, the tiny wave is moved, pointby point, against the large wave. The FFT computation looks something like the (exaggerated) view below:Here the little blue wave represents the exponential used in computing the power in the FFT. So: which of thetwo blue waves above give the greatest power? The way to compute the FFT is to multiply the exponential bythe function, and integrate.The ﬁrst little wave is in a region of the sine curve that looks approximately like f (x) x thesecond, f (x) 1 x2 ( Taylor series computation would show that in fact, f (x) 1 12 x2 ). Thenthe contributions to the energy at each of the two locations would be π πx · sin(5x) dx; 11(1 x2 ) · sin(5x) dx 22 ππ π πx2 · sin(5x) dxWe could integrate by hand, or have Matlab do the integral; instead we’ve brought in another program, Mathematica, to do the integration.In[2]: Integrate[x*Sin[5x], {x, -Pi, Pi}]Out[2] 2 Pi---5

In contrast,In[3]: Integrate[x 2*Sin[5x], {x, -Pi, Pi}]Out[3] 222 - 25 Pi -2 25 Pi---------- ---------- 0125125Even if you’re a bit off from being exactly centered at zero, the contribution is small:In[5]: Integrate[(x-.01) 2*Sin[5x], {x, -Pi, Pi}]Out[5] 20.0799 - 0.02 Pi - Pi---------------------- 52-0.0799 - 0.02 Pi Pi----------------------5Whoops! What is that as a number?In[6]: NIntegrate[(x-.01) 2*Sin[5x], {x, -Pi, Pi}]Out[6] -0.0251327The little blue wave sees a much larger contribution from the straight-line portion of the curve; very roughly, thelittle blue wave picks out the slope of the curve. And that slope is near zero near where the large function peaks.So that’s what the fine detail in the spectrogram represents, very roughly.By the way -- must we say ‘little blue wave’? It sounds like the wave in the big blue house.There is a fancier name: “wavelet”. Which is where we go next.Finally -- thought! -- we might want to use the spectrogram to try to understand the signals we already have. Trydoing spectrograms of the signalsz1 y. 2.*sin(2*pi*8*y cos(2*pi*3*y));z2 sin(2*pi*8*y 3);In z2, can you detect the instantaneous frequency?

lost when we look at power alone; it is kept in the phase, but is coded in ways that are hard to read. The spectrogram is a clever trick to get time info. Say you have some data sampled at 1024Hz, and you have 3min of data -- 3072 data points. Chunk the data into pieces of size 128, and take a 128-point FFT. Each FFT

Related Documents:

Seabed type and source parameters predictions using ship spectrograms ...

Seabed type and source parameters predictions using ship spectrograms in convolutional neural networksa) David F. Van Komen,1,b) Tracianne B. Neilsen,1,c) Daniel B. Mortenson,1 Mason C. Acree,1 David P. Knobles,2 Mohsen Badiey,3 and William S. Hodgkiss4 1Physics and Astronomy, Brigham Young University, Provo, Utah, 84604, USA 2Knobles Scientiﬁc and Analysis, Austin, Texas, 78731, USA

5 Views

1y ago

Topic: Spectrogram, Cepstrum and Mel-Frequency Analysis

Some Real Spectrograms Dark regions indicate peaks (formants) in the spectrum. Speech Technology - Kishore Prahallad (skishore@cs.cmu.edu) 12 Why we are bothered about spectrograms Phones and their properties are better observed in spectrogram. Speech Technology - Kishore Prahallad (skishore@cs.cmu.edu) 13

16 Views

2y ago

LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer ...

into a Mel-spectrogram. Most speech synthesis systems are designed in a two-step manner: generation of Mel-spectrograms from input texts (i.e., a feature prediction module), followed by synthesis of waveforms with a pre-trained neural vocoder given the Mel-spectrograms (i.e., a waveform generation module) [8-13]. Al-

8 Views

5m ago

The University of Texas System Board of Regents Accountability and ...

The University of Texas at Arlington z The University of Texas at Austin The University of Texas at Brownsville z The University of Texas at Dallas The University of Texas at El Paso z The University of Texas - Pan American The University of T exas of the Permian Basin z The University of Texas . Graduation rates of medical, dental, nursing .

24 Views

1y ago

How Do I Access My … MS Science e-Book?

Texas Math Course 1 (Grade 6) Texas Math Course 2 (Grade 7) Texas Math Course 3 (Grade 8) Texas Grade 6 iScience Texas Grade 7 iScience Texas Grade 8 iScience Texas Biology Texas Chemistry Texas Integrated Physics and Chemistry Texas Physics MHEtexas.com MK14M03416

106 Views

3y ago

THE ECNL REGIONAL LEAGUE: THE PROVING GROUND

Missouri City, Texas San Antonio City San Antonio, Texas San Antonio Surf Kyle, Texas SG1 Soccer Club Katy, Texas Sting Austin Austin, Texas Sting Corpus Corpus Christi, Texas Sting San Antonio San Antonio, Texas TEXAS Ajax SC New Braunfels, Texas Alamo City SC San Antonio, Texas Albion Hurr

87 Views

2y ago

SEVP-Certified Schools in AL, AR, FL, GA, KY, MS, NC, TN ...

TEXAS . Brown Mackie College Dallas/Fort Worth . TEXAS . Salon Boutique Academy . TEXAS . Cornerstone Christian Academy . TEXAS . ProFlight Aviation Services LLC . TEXAS . Central Texas Christian School . TEXAS . East Texas Christian School . TEXAS . JAMIE'S HOUSE CHARTER SCHOOL . TEXAS . Wharton County Junior College . Lee-Scott Academy .

76 Views

2y ago

2019 Program Report

Prairie View A&M University 1 The University of Texas at Austin 22 Sam Houston State University 6 The University of Texas at Dallas 11 Stephen F. Austin State University 2 The University of Texas at El Paso 6 Tarleton State University 1 The University of Texas at San Antonio 7 Texas A&M International University 4 The University of Texas at Tyler 1

47 Views

1y ago

Recent Views

WHAT LAW IS ? An Introduction to Law

common law system civil law system!! sources of law in civil law !! a1. primary: statutes (written law) enacted by legislative power are the principal source of law. ! a2. two subsidiary sources of law: ! a2.1 administrative regulations a.2.2 customs!! ! sources of law in common law !!! b1. two primary sources of

2y ago

385 Views

12 PUBLIC LAW AND PRIVATE LAW - Home: The National .

INTRODUCTION TO LAW MODULE - 3 Public Law and Private Law Classification of Law 164 Notes z define Criminal Law; z list the differences between Public and Private Law; and z discuss the role of Judges in shaping Law 12.1 MEANING AND NATURE OF PUBLIC LAW Public Law is that part of law, which governs relationship between the State

3y ago

745 Views

Dr. Ram Manohar Lohiya National Law University, Lucknow

2. Health and Medicine Law 3. Int. Commercial Arbitration 4. Law and Agriculture IXth SEMESTER 1. Consumer Protection Law 2. Law, Science and Technology 3. Women and Law 4. Land Law (UP) Xth SEMESTER 1. Real Estate Law 2. Law and Economics 3. Sports Law 4. Law and Education **Seminar Courses Xth SEMESTER (i) Law and Morality (ii) Legislative .

3y ago

496 Views

Companies Law - Cayman Islands dollar

Law 1 of 1971-15th December, 1970 Law 7 of 2000- 20th July, 2000 Law 7 of 1973-28th June, 1973 Law 5 of 2001-20th April, 2001 Law 24 of 1974-22nd November, 1974 Law 10 of 2001-25th May, 2001 Law 25 of 1975-9th December, 1975 Law 29 of 2001-26th September, 2001 Law 19 of 1977-10th November, 1977 Law 46 of 2001-14th January, 2002

3y ago

454 Views

It’s the Law!

ciples stated in Boyle’s Law, Charles’ Law, Gay-Lussac’s Law, Henry’s Law, and Dalton’s Law. Students will be able to explain the application of Boyle’s Law, Charles’ Law, Gay-Lussac’s Law, Henry’s Law, and Dalton’s Law to observations or events related to SCUBA diving. MateriaLs None audio/visuaL MateriaLs None teachinG tiMe

2y ago

378 Views

Common-Law Courts in a Civil-Law System: The Role of United Stat-es .

He learns the law, not by reading statutes that promulgate it or treatises that summarize it, but rather by studying the judicial opinions that invented it. This is the famous case-law method, 1 Oliver Wendell Holmes, Jr., The Common Law (1881). · : .·· ' COMMON-LAW COURTS IN A CIVIL-LAW SYSTEM pioneered by Harvard Law School in the last .

1y ago

197 Views

Faculty of Juridical, Social and Political Sciences Year .

Law L Law IV 8 Drept procesual civil II / Civil Procedure Law II 5 Law L Law IV 8 Dreptul comerțului internațional / International ommercial Law 4 Law L Law IV 8 riminalistică / Forensics 4 Law L Law IV 8 Practică de cercetare pentru elaborarea lucrării de lincență(3 săptămân

2y ago

384 Views

Ohm ’s Law

Ohm ’s Law Ohm's law states that, in an electrical circuit, the current passing through most materials is directly proportional to the potential difference applied across them. 3-1—3-3: Ohm ’s Law Formulas There are three forms of Ohm’s Law: I V/R V IR R V/I where:File Size: 1MBPage Count: 40Explore furtherOhm's Law Quiz MCQs with Answers Ohm Lawohmlaw.comOhm’s Law Worksheet - Basic Electricity - All About omohms law worksheet - eering.orgOhm’s Law Worksheet - Richmond County School Systemwww.rcboe.orgOhm's Law with Examples - Physics Problems with Solutions ended to you b

2y ago

295 Views

Intermediate Law Law and You Worksheet 3: Australian law - Home Affairs

4. There are different kinds of law to deal with different kinds of problems. Four important kinds of law are civil law, criminal law, family law and administrative law. Civil law deals with disputes between individuals; for example, if someone sells you goods that are faulty, or that cause you injury or damage, you can take that person to court.

4m ago

110 Views

PRINCIPLES OF BUSINESS LAW - DPHU

ABE Diploma in Business Administration Study Manual PRINCIPLES OF BUSINESS LAW Contents Study Unit Title Page Syllabus i 1 Nature and Sources of Law 1 Nature of Law 3 Historical Origins 6 Sources of Law 9 The European Community and UK Law: An Overview 13 2 Common Law, Equity and Statute Law 23 Custom 25 Case Law 26 Nature of Equity 32

3y ago

285 Views

Principles of Common Law Public Law – Part 1 The British .

Institute of Law-The UK Constitution-Separation of Powers-Rule of Law-Sovereignty of Parliament-Royal Prerogative-Judicial Review-Human RightsIntroduction to UK Public Law-Court system-The Trial-Common law –judge made law-Doctrine of precedent-Challenges of judge made law-Statutory

2y ago

130 Views

A Trail Guide to Careers in Environmental Law

law, constitutional law, property law, bankruptcy law, criminal law, food and drug law, land use planning law, and international law. A distinctive aspect of environmental practice is the role of science in advocacy efforts.

3y ago

241 Views

Accounting Technicians Diploma (ATD) Examination Syllabus

Apply law of contract and tort in various scenarios Apply general principles of business law in practice. CONTENT 2.1 Elements of the legal system 2.1.1 Nature, purpose and classification of law - Meaning of law - Nature of law - Purpose of law - Classification of law - Law and morality 2.1.2 Sources of law - The Constitution

3y ago

216 Views

MsEffie’s List of Poetry Essay Prompts for Advanced .

15 Law is as I’ve told you before, Law is as you know I suppose, Law is but let me explain it once more, Law is The Law. Yet law-abiding scholars write: 20 Law is neither wrong nor right, Law is only crimes Punished by places and by times, Law is the clothes men wear

2y ago

181 Views

An Introduction to Kansas Law Impacting the Oil and Gas Industry .

property law and contract law to the oil and gas subject matter, or it is an adaptation of property law or contract law to create a unique rule that we label "oil and gas" law. a. "Adaptation" will in many cases be, at most, a charitable way of describing what courts do to property law and contract law to develop a new rule of "oil and gas" law. b.

1y ago

143 Views

Spectrograms - University Of Texas At Austin

It looks like you're using an ad-blocker