Clustering Bach Chorales: Insights Into SATB And Bach’s Style

2y ago

11 Views

2 Downloads

661.06 KB

6 Pages

Last View : 2d ago

Last Download : 3m ago

Upload by : Troy Oden

Report this link

Download PDF

Transcription

Clustering Bach Chorales:Insights into SATB and Bach’s StyleDiego Hernandez, Hope Casey-Allen, Jian Yang LumStanford University, California, United States1Introduction and Literature ReviewFew composers have been as influential in the Western musical tradition as Johann Sebastian Bach (16851750), whose four-part chorales for soprano, alto, tenor, and bass (abbreviated SATB, and listed in orderfrom highest to lowest) have become well-known distillations of important music-theoretical principles. Suchprinciples, dealing with both the construction of individual parts and the interactions between those parts,came to dominate Western music for about 150 years after Bach’s time and continue to inform musicalunderstanding today.Some research in machine learning has been devoted to computerized harmonizations (i.e., given a sopranomelody, generate the other three voices in Bach’s style). However, previous projects have encounteredobstacles in their work due to a lack of rigorous understanding of the nuances in Bach’s style. For instance,Allan and Williams (2005) attempted to generate chorale harmonizations in Bach’s style using a HiddenMarkov Model. The results were suboptimal: although the relationships between voices in the resultingchorales were consistent with harmonic rules seen in textbooks, the individual voices themselves sometimeserratically jumped in pitch instead of flowing in natural, intentional contours.It is clear that research is needed to more precisely characterize SATB voices in Bach chorales, the insightsfrom which would not only improve algorithms that seek to generate harmonizations but would be useful tomusical theorists and performers in understanding and interpreting music.Our first goal is thus to determine features (independent of relative positions) that distinguish individual SATB voices in Bach chorales. Our second goal is to find and interpret clusterings of chorales thatmay be artistically useful (e.g., helpful in identifying musically similar chorales for purposes of inspiration,comparison, etc.)We have 2 inputs, corresponding to the different goals: inputs for the first goal are the features from thevoice parts of these chorales, looking at 183 4 SATB parts in total. We used k-means clustering, with k 4, in an attempt to yield clusters that divide the voices into their natural SATB taxonomies, revealing thedefining characteristics of each melody; and softmax / multinomial regression for voice prediction, where theoutput is one of the possible SATB values.The inputs to our second goal are features we extracted from the scores of 183 of Bach’s chorales. Weused the elbow method to determine that k 4 for k-means clustering on this input, and outputted distinctclusters for the chorales which reveal insights into Bachs style; and we used EM in an attempt to revealtraits of Bachs style as well.We believe our task has never been attempted in the literature. The closest task we could find to ours is in[3], in which Quinn and Mavromatis clustered every chord transition in a combined corpus of chorales of twocategories: chorales by Bach and modal chorales from about a century before Bach. They then compared eachchorale category’s coverage of each cluster as a means of identifying distinguishing harmonic characteristicsof each category. Although this research provides a window into the overall harmonic landscape of Bach’schorales, it says nothing about the characteristics of SATB voices.1

2Dataset and FeaturesOur dataset contains 183 Bach chorales and 183 4 SATB voice parts, all accessed from “http://kern.humdrum.org/cgi-bin/ksdata?l musedata/bach/chorales&file chor-01.krn&f musedata“ where 01 is thenumber of the chorale. The dataset is encoded in the Humdrum **kern data format; Humdrum was developedby CCARH (Center for Computer Assisted Research in the Humanities) at CCRMA (within Stanford).(a) Sample of a Bach Chorale opening. S is the topline, A the second highest, T the third, B the lowest.(b) The same score, translated into **kernAs seen above, musical information (pitches, metronome markings, measures, metadata, etc) is encoded torepresent a full musical score: We used a toolkit developed by MIT called Music21 to separate the choralesinto parts and extract their features. We categorized relevant features into 9 buckets. For some buckets withmultiple features, we used PCA and extracted the principal component (which for all cases explained at least96% of variation), while for others we added or subtracted the feature values. Normalization was performedon all variables to have mean 0 and variance 1, to aid k-means clustering by assigning equal importance toeach predictor/variable.3MethodsThe two goals we tried to achieve required two different approaches each; determining the differencebetween individual voices is fundamentally a classification problem, while attempting to observe similaritiesbehind chorales that do not have an observable grouping is an unsupervised problem. To that extent, weemployed a mixture of parametric and nonparametric methods for differentiating SATB voices (namely,softmax/multinomial logistic regression, and k-means clustering) and multiple unsupervised methods (kmeans and EM) for clustering chorales.The softmax, or multinomial regression model, posits k (in this case, 4, one corresponding to one voicetype) different outcomes/voice types, with total probability summing to one (with a constraint on the lastvariable). Maximization of the log-likelihood of the model:mXi 1lnkYl 1Teθ lTPkj 1!1{y(i) l}x(i)eθ j x(i)was done using a single-hidden-layer neural network, in which a (hidden) layer of variables is estimatedusing a vector of weights w with w nk (n being number of features, k 4). The neural net thenused backpropagation of errors to the inputs with gradient descent to update the parameters returned. Thesoftmax model was chosen as it did not require breaking down into binary classification problems.k-means clustering iteratively seeksPto find the cluster centroids that minimize the aggregate residual summof squares of all points in that cluster, i 1 x(i) µc( i) 2 , by first reassigning both the assignments of eachpoints to the nearest centroid, then moving that centroid to the mean of all points with that same assignment.k-means was used as an exploratory data analysis mechanism, as well as understanding properties of eachgroup by exploring properties behind its centroids.2

The EM algorithm, used when a latent random variable is suspected (in this case, the different stylesBach may have used), performs maximum likelihood estimation in a two-step mechanism: it first constructsa lower-bound on the likelihood via Jensen’s inequality (by equating the new distribution of the latentvariables to be the posterior distribution of the latent variables given the observed variables and the currentparameters) (E-step), then optimizes it (by maximizing the lower-bound of the likelihood with respect to theparameters) (M-step). It was another option over the k-means algorithm, in that it allowed us to observepotential distribution properties of the latent variables, besides looking at centroids. All data analysis wasdone in R, with packages nnet and mclust for softmax and EM respectively.44.1Results and DiscussionIndividual Voice Clustering/ClassificationFor clustering, besides attempting to lower the misclassification rate, we hoped to identify variables ofinterest that strongly influence the classification of voice lines into their voice types, as the goal was to studythe properties of different melodic lines.The k-means approach (k 4, corresponding to SATB voices) yielded a misclassification rate of 32.1%,with most errors misidentifying the SAT voices (only 6% involving both Type I or II errors involving thebass line); this implies that the bass line is most distinct, followed by the soprano line.Table 1: K-means cluster results versus actual voice 78(a) Cluster Results, plotted against first two principalcomponents.The softmax approach had a 10-fold cross-validation error of 25.5%. With the dataset, the p-values werecalculated for each estimator using the test statistic of Z E(θi ) ; only the p-values that were statisticallyVar(θi )significant (i.e. p-value less than 0.05, for the null hypothesis that θi 0) were considered for analysis.As an entire predictor, only averageMelodicInterval and rangeIndivVoices were statistically significant for all three voice types (one of the four categories, the bass, was arbitrarily used as the baseline, henceonly three predictors were needed). For both predictors, the bass had the highest average melodic intervaland the largest range of individual voices, agreeing with music-theoretical approaches of wide movements inthe bass line. Amongst other variables that were statistically significant, the alto and tenor both showedincreased percentages of repeated notes, the soprano generally had faster melodic tempi and a lot more3

diatonic movement (as opposed to chromatic movement). For all data, duration of melodic intervals was nota useful predictor.4.2Chorale clustering: Unsupervised LearningIn picking the appropriate k to use for the initial k-means exploration, we elected to use the elbow method(b), a visual inspection method that attempts to find the maximum number of k for which the improvementsby adding one more are minimal (in this case, looking at the smallest angle for each point, when plottedagainst residual sum of squares (with respect to centroids)). While there was no clear-cut change in gradient,an observable bend at k 4 was noted, and as such k was initialized to be 4.(b) Elbow Method Plot: RSS against k.We pick the point with sharpest changeof gradient, here 4.(c) Cluster Visualization, plotted onfirst two principal components.The initial round of clustering (c) produced an equal-sized grouping of clusters. Observing the clustercentroids, the key findings were that the four groups had properties corresponding to:1. arpeggiated, repeated notes2. small range3. big melodic interval, few repeated notes4. long melodic arcs, highly consonant, big rangeThe EM algorithm (which proceeded by a Gaussian mixture modelling), by contrast, returned a clusternumber of 2, based off the calculation of the Bayes Information Criterion (BIC), 2 ln(L̂) k ln(n); the lowestBIC for all models returned a model that had an ellipsoidal distribution, had equal volume and orientation,and had 2 clusters.(d) Classification Boundary, 2-cluster model.(e) Biplot of variables, 2-cluster model.However, the EM algorithm has one cluster with only 9% of data points (16), while the other cluster had91% of the data points. Furthermore, a classification decision boundary (on a two-dimensional feature space4

comprising the first two principal components of the data) (d) reveal a circular inner, coupled with an outerand scattered cluster.Similarly, a biplot of all variables (e) do not reveal two distinct cluster groups, while a density model ofboth latent variables (cluster groups) (f) show that both means overlap within a high-density region, hencemaking it hard to separate the clusters into linearly separable groups.(f) Density plots, 2-cluster model.(g) Classification Boundary, 4-cluster model.Another round of EM was conducted, this time with k 4 (in conformance to the elbow result). Theresults were not as clear as that of k-means, featuring a substantial proportion of outliers (g). The findingsof the EM demonstrate that perhaps more variables might be required to make a more conclusive stance oncluster assignments, and that perhaps Bach did not actually have multiple styles used. Furthermore, theEM algorithm’s failure to detect any substantial/clear-cut clustering may have to do with the assumptionrequired that the model was Gaussian; some of the variables (e.g. dissonance) was more likely than notdrawn from either a Bernoulli model or some non-Gaussian distribution.5Conclusions and Future WorkOur first goal, investigating the differences in musical lines through structural elements, proved usefulin constructing a typology of voice type; the conclusions (mostly through softmax), which include basslines having widest jumps and largest range and soprano lines having faster melodic tempi etc, conform totheoretical part-writing rules.However, our second objective - to observe clustering within different Bach chorales - proved mixed; apotential failing of the EM algorithm, as pointed out, could have been the Gaussian assumption, while thek-means centroid analysis could see further research done on it, not only to verify that such groupings exist,but to see how it may have correlated with his other works across time (his cantatas, large-scale masses).It follows that future work would include investigating our second objective even further. We could gatherdata for other composers and run EM/other algorithms to clarify the potential failing of our EM algorithmand see if style differences manifest. We would also perform further contextual analysis by extracting featuresfor date of composition (only available for some pieces). This would enable us to investigate not only ifa composer’s style changed over time, but also how their composition style corresponds to the liturgicalcalendar, which was a major influence for much of western musical tradition. Overall, the insights gleanedfrom more research in this area would shape musicians’ understanding of these pieces and the subtleties thataccompany them.5

References[1] Allan, Moray, and Christopher KI Williams. ”Harmonising chorales by probabilistic inference.” Advances in neural information processing systems 17 (2005): 25-32.[2] Morris, Robert D. ”New directions in the theory and analysis of musical contour.” Music Theory Spectrum 15.2 (1993): 205-228.[3] Quinn, Ian, and Panayotis Mavromatis. ”Voice-Leading Prototypes and Harmonic Function in TwoChorale Corpora.” MCM. 2011.[4] Temperley, David. ”Probabilistic Models of Melodic Interval.” Music Perception: An InterdisciplinaryJournal 32.1 (2014): 85-99.[5] Witten, Ian H., Leonard C. Manzara, and Darrell Conklin. ”Comparing human and computationalmodels of music prediction.” Computer Music Journal (1994): 70-80.6

ual SATB voices in Bach chorales. Our second goal is to nd and interpret clusterings of chorales that may be artistically useful (e.g., helpful in identifying musically similar chorales for purposes of inspiration, comparison, etc.) We have 2 inputs, corresponding to the di erent goals: inputs for the rst goal are the features from the

Related Documents:

J.S. Bach Chorales

Bach chorales, from music theorists and theory students interested in studying the Bach chorale style or in using the chorales in the classroom, to musicologists and Bach scholars interested in the most up-to-date research on the chorales, to choral directors and organists interested in performing the chorales, to amateur Bach-lovers alike.File Size: 1MBPage Count: 65

29 Views

2y ago

AUTOMATIC STYLISTIC COMPOSITION OF BACH CHORALES …

Bach himself. Many of Bach's chorales are harmoniza-tions by Bach of pre-existing melodies (not by Bach) and certain melodies (by Bach or otherwise) form the basis of multiple chorales with different harmonizations. We extend this harmonization task to the completion of chorales for a wider number and type of given parts. Let

26 Views

2y ago

THREE BACH CHORALES CHORALES - ApRo Music

THREE BACH CHORALES CHORALES Four-part arrangements with ﬂexible part assignments. Will work with many different sized groups, from quartets to full band. All parts are provided in the form of a single master page. The purchaser is licensed to duplicate parts as required for use in their own program. For Young Bands 19022 - 49A Avenue .

27 Views

2y ago

Bach in Beta: Modeling Bach Chorales with Markov Chains

Bach used these four-part chorales in larger works, such as cantatas or oratorios. Today, 371 of these chorale harmonizations still survive. What these chorales have provided for music theorists and historians is a large data-set of music from the Baroque era

15 Views

2y ago

Teaching Figured Bass with Keyboard Chorales and C. P. E ...

Bach’s vocal chorales have had in harmony curricula ever since C. P. E. Bach’s first publication of them in 1765.5 The aim of this article is to take the first steps in developing a curriculum centered around the keyboard chorale that can replace our current focus on Bach’s vocal chorales.

12 Views

2y ago

Parallel successions of perfect fifths in the Bach chorales

melodies Johann Sebastian Bach usually “went to considerable pains to avoid consecutives”, as Malcolm Boyd argues.iii Given that some successions of this kind do exist in the Bach chorales, though, the goal of our project was to account for all cases of consecutives formed in the chorales

40 Views

2y ago

Distinguishing Chinese Guqin and Western Baroque pieces ...

Bach Chorales dataset is a symbolic music dataset formatted in MuiscXML, containing 409 pieces with 7241 measures. Bach Chorales dataset is mostly in four parts harmony, composed of four parts (Soprano, Alto, Tenor, Bass) in each score. In Bach Chorales dataset, some scores have extra accompaniment

25 Views

2y ago

Blueprint One The Future atLloyd’s

Blueprint One marks the start of a transformational next stage of the Future at Lloyd’s. It is the beginning of our plan to create a modern, more relevant Lloyd’s market and, in doing so, lead the insurance industry for the next generation. This blueprint is: Our strategic intent, describing our vision for how the Lloyd’s market of the future will look . Our current thinking on .

45 Views

3y ago

Recent Views

Grammar as a Foreign Language - List of Proceedings

Grammar as a Foreign Language Oriol Vinyals Google vinyals@google.com Lukasz Kaiser Google lukaszkaiser@google.com Terry Koo Google terrykoo@google.com Slav Petrov Google slav@google.com Ilya Sutskever Google ilyasu@google.com Geoffrey Hinton Google geoffhinton@google.com Abstract Synta

2y ago

445 Views

Attention is All you Need - NIPS

Google Brain avaswani@google.com Noam Shazeer Google Brain noam@google.com Niki Parmar Google Research nikip@google.com Jakob Uszkoreit Google Research usz@google.com Llion Jones Google Research llion@google.com Aidan N. Gomezy University of Toronto aidan@cs.toronto.edu Łukasz Kaiser Google Brain lukaszkaiser@google.com Illia Polosukhinz illia .

1y ago

303 Views

GSA Implementation of Google (G) Suite

Google Meet Classic Hangouts Google Chat Google Calendar Google Drive and Shared Drive Google Docs Google Sheets Google Slides Google Forms Google Sites Google Keep Apps Script D

2y ago

316 Views

Google Drive (Google Docs, Google Sheets, Google Slides)

Google Drive (Google Docs, Google Sheets, Google Slides) Employees are automatically issued a Kyrene Google account. Navigate to drive.google.com. Use Kyrene email address and network password to login. Launch in Chrome browser for best experience. Google Drive is a cloud storage sys

2y ago

388 Views

Quick Guide of Using Google Home to Control Smart Devices

Configuration needs Google Home app. Search "Google Home" in App Store or Google Play to install the app. 3.1 Set up Google Home with Google Home app You can skip this part if your Google Home is already set up. 1. Make sure your Google Home is energized. 2. Open the Google Home app by tapping the app icon on your mobile device. 3.

1y ago

326 Views

Elaboração de Provas Online usando o Formulário Google Docs

2 Após o login acesse o Google Drive ou o Google Docs e selecione a ferramenta Google Forms (Formulários). Clique na caixa de Ferramentas do Google, localizada no canto direito superior da tela e selecione o Google Drive. Na tela do Google Drive clique em New , opção More e selecione Google Forms. OBS: É possível acessar o google

10m ago

123 Views

ACS WASC Templates

File upload, Folder upload, Google Docs, Google Sheets, or Google Slides. You can also create Google Forms, Google Drawings, Google My Maps, etc. Share with exactly who you want — without email attachments. Search or sort your list of files, folders, and Google Docs. Preview files and Google Docs.

2y ago

366 Views

Google Drive - San Bernardino City Unified School District

Google Apps All of the Google applications that are available upon logging into Google.com (G , Gmail, Gphotos, Gdrive, etc.). Google Suite Google’s online cloud based office companion applications (Docs, Sheets, Slides). Google Drive Google’s online cloud storage and file sharing/collaboration application.

2y ago

378 Views

Single Sign On for Google Apps with NetScaler Unified Gateway

Google Apps for Work is a suite of cloud computing productivity and collaboration applications provided by Google on a subscription basis. It includes Google’s popular web applications including Gmail, Google Drive, Google Hangouts, Google Calendar and Google

2y ago

295 Views

Serviceteil

Google 84, 87, 124 Google 110 Google AdWords 101, 103 Google Alerts 127 Google Analytics 89 Google Maps 100, 110, 173 Google-Maps 63 Google Places 100, 103, 124 Graphiken 66 H Haftung 170 Haftungsausschluss 72 Hausfarbe 11 Headline 35 Heilmittelwerbegesetz 14, 69, 163 Heilversprechen 164 HONcode 78 HTML 58 HWG 31 I Imagefilm 31

2y ago

336 Views

Best practices for managing identities when you move to Google Cloud

Google Cloud. To provide t he informat ion an organizat ion would ne e d to transfer data and ownership from one Google Account to anot her for s ome of t he noncore Google s er vice s, such as Google Ads, Google Analyt ics, or DV360. Intende d audience Organizat ion administrators. Sta planning Google Cloud / Google Wor kspace migrat ion. Key .

1y ago

481 Views

MANAGERIAL FINANCE - GBV

of Managerial Finance page 2 Introduction to Managerial Finance 1 Starbucks—A Taste for Growth page 3 1.1 Finance and Business What Is Finance? 4 Major Areas and Opportunities in Finance 4 Legal Forms of Business Organization 5 Why Study Managerial Finance? Review Questions 9 1.2 The Managerial Finance Function 9 Organization of the Finance

3y ago

6.8K Views

Chapter 1 The roles of finance function in organisations

The roles of the finance function in organisations 4. The role of ethics in the role of the finance function Ethics is the system of moral principles that examines the concept of right and wrong. Ethics underpins an organisation’s sustained value creation. The roles that the finance function performs should be carried out in an .File Size: 888KBPage Count: 10Explore furtherRole of the Finance Function in the Financial Management .www.managementstudyguide.c Roles and Responsibilities of a Finance Department in a .www.pharmapproach.comRoles and Responsibilities of a Finance Department .www.smythecpa.comTop 10 – Functions of Business Finance in an om23 Functions and Duties of Accounting and Finance nded to you b

1y ago

335 Views

2013 National Senior Games presented by Humana Medal

3 martin cherie ann canada track & field 2 martin cherieann canada track & field 3 rossi elsie canada track & field 1 stuart pam canada track & field 2 stuart pam canada track & field 3 stuart pam canada track & field 1 stuart pam canada track & field 1 sleepers canada volleyball 3 volleyhawks canada volleyball 1 horiuchi kumi co archery

2y ago

176 Views

International Registered and Reporting Companies .

Dorel Industries Inc. Canada GLOBAL MKT Draxis Health Inc. Canada GLOBAL MKT Dundee Corp. Canada OTC DynaMotive Energy Systems Corp. Canada OTC Eiger Technology Inc. Canada OTC El Nino Ventures, Inc. Canada OTC Eldorado Gold Corp. Canada AMEX Elephant & Castle Group, Inc. Canada OTC Emgold Mining Corp. Canada OTC

1y ago

112 Views

Clustering Bach Chorales: Insights Into SATB And Bach’s Style

It looks like you're using an ad-blocker