A Tale Of Two Cities: Battle Of The Neighborhoods Capstone .

2y ago

13 Views

2 Downloads

1.05 MB

10 Pages

Last View : 28d ago

Last Download : 3m ago

Upload by : Matteo Vollmer

Report this link

Download PDF

Transcription

A Tale of Two Cities: Battle of the Neighborhoods Capstone Project ReportOxford, UK versus Atlanta, USA (Round 1)Author: Theresa K FosterDate: 15 May 2020The link to my Jupyter Notebook with the coding for this data analysis is available below:A Tale of Two Cities: Battle of the Neighborhoods Capstone Project Jupyter Notebook1. IntroductionOxford is a city in central southern England with a population size of around 155,000 people. The city is known for itsUniversity, which was established in the 12th century, but is also a hub for manufacturing, publishing and science basedindustries and research, as well as education and tourism. Atlanta is the capitol of the US state of Georgia and is themost populous city in the state with an estimated 498,044 residents. Atlanta is a culturally and economically diverse citywith dominant economic sectors including aerospace, transportation, professional and business services, media andmedical operations, and information technology.The aim of this project is to explore the neighborhoods in both cities and group them by common nearby venues. Thiswill assist anyone visiting or relocating between the cities to consider which areas are most similar to their currentneighborhood and therefore might offer their preferred range of amenities. This information is very useful when movingto an unknown city and will help narrow down the list of areas to search for a new home, thus speeding up therelocation process and avoiding overly long and potentially pricey stays in hotels or other temporary livingarrangements. Alternatively for those visiting between the cities, this information could be useful in deciding the bestlocation for a vacation rental or hotel booking, based on the interests and priorities of the traveler(s).2. DataThe following data sources were used to complete this project:1.2.3.4.5.Oxford postcode data from Doogal.co.uk updated 2020Atlanta zip code and neighborhood data from local real estate company (The Keen Team) 2020Cross referenced Atlanta zip code and neighborhood data from US Map Guide 2020US Longitude and Latitude data by zip code from Open Data Soft.com 2020Foursquare API

2.1. Oxford, UK Neighborhood Data Sourcing and CleaningThe data set (1) for Oxford was the most complete and included postal code data, ward (neighborhood) names, and thecorresponding latitude and longitude coordinates for all OX postcodes, which covers the entire county of Oxfordshire.The data was in the form of a downloadable excel spreadsheet, which I then cleaned and formatted to include onlyOxford city postcodes, ward (neighborhood) names and map coordinates. Finally, I reduced the list of wards byremoving duplicate values so that there would only be one occurrence of each neighborhood and corresponding data. Itshould be noted that this method randomly dropped duplicates so the remaining full postal codes corresponding to eachneighborhood were one of many possible options. Different post code choices would have had slightly differing latitudeand longitude coordinates. This may have affected the resulting venue data sourced from Foursquare and skewed theresults. I then uploaded this data set to my Jupyter notebook and used the insert to code function to transform it intoa pandas data frame.2.2 Atlanta, USA Neighborhood Data Sourcing and CleaningThe data sets (2)(3) used to source a list of Atlanta neighborhoods and corresponding postal codes (zip codes) weresimply lists from an Atlanta real estate website and a US map guide website respectively. I manually copied and inputthis data into an excel spreadsheet and added any differences between the data (missing or additional neighborhoods orzip codes) to ensure a more complete breakdown. Unfortunately data available from local city government sources wasnot in the required format so I could not use more authoritative sources. Therefore the breakdown of neighborhoods tozip codes in this data set should be taken as advisory only and may differ between data sets. Initially I was going to useGeopy Nominatum to find the map coordinates for each zip code. However the results were wildly inaccurate. As analternative I found and downloaded a spreadsheet of all US zip codes and corresponding latitude and longitudecoordinates (4) from the Open Soft Data website. I manually filtered this excel spreadsheet to list only Atlanta zip codesand map coordinates. I uploaded both excel sheets to my Jupyter notebook using the insert to code function totransform them into Pandas data frames, dropping any unnecessary columns. Finally, I combined the separate Atlantadata sets using a Pandas join function on the common column value of zip codes.

The resulting dataset for Oxford had 24 neighborhoods and the dataset for Atlanta had 28 neighborhoods.2.3 Final List of Neighborhoods Used for this ProjectOxford (24)Atlanta (28)3. MethodologyBefore sourcing the venue data, I completed initial visual analysis of the neighborhood data for both cities to view thelayout of the neighborhoods on a map. This was to ensure the coordinates were initially generally correct and to see thespread of the neighborhoods across each city, as they vary significantly in geographical size.Using the Nominatum tool in Geopy, I calculated the latitude and longitude coordinates of both cities.

I then used Folium to create maps of the two cities using the above generated coordinates. Finally, I was able to codemarkers onto each city map of the corresponding neighborhood coordinates using the data from the previously createddata frames.3.1 Oxford, UK Neighborhoods Map3.2 Atlanta, USA Neighborhoods Map

3.3 Foursquare API: Venue DataUsing Foursquare, I was able to generate a list of venues by category in each neighborhood based on the correspondingmap coordinates in the data sets for both cities. I set the radius to 500 and limited the venue results to 100 perneighborhood or set of coordinates. I then transformed this venue data into Pandas data frames (see below example ofOxford neighborhood venue data generated using Foursquare API.) The process was repeated for Atlantaneighborhoods.Finally, I created a new data frame for each city listing the top 10 most common venues in each neighborhood based onfrequency.3.4 Oxford Top 10 Venues by Neighborhood3.5 Atlanta Top 10 Venues by Neighborhood

3.6 Finding the best K for K-Means ClusteringK-Means is one of the most common methods of unsupervised machine learning for clustering. Using one hot encodingand mean frequency on the new data frames, I was able to then apply algorithms from the SciKit Learn library tocalculate the best K value for K-means clustering of the neighborhoods in each city. I initially used the Silhouette methodbut the results were inconclusive. I therefore tried the Elbow method (sum of squared distances) and achieved slightlybetter results in both cases. I used Matplotlib to plot the results.3.7 Finding K for Oxford ClusteringI determined the best K would be either 5 or 6 for the Oxford venue data. However after implementing both, it was clearthe neighborhood clustering stopped at 5.3.8 Finding K for Atlanta ClusteringI determined the best K could be 4 for the Atlanta venue data. However I felt that was a bit low for clustering 28neighborhoods and wanted there to be at least as many clusters in Atlanta as in Oxford. I implemented clustering using5 and 6 and ultimately choose 6 as a good option for K in this case.

3.9 K-Means Clustering NeighborhoodsUsing the K-means algorithm, I clustered the neighborhoods in both cities and merged this data with the Top 10 Venuedata frames. I also cleaned the data to ensure the clusters were integers and not floats, as otherwise they would notshow up properly on the maps using Folium.Oxford Clustered Neighborhoods Pandas Data frameAtlanta Clustered Neighborhoods Pandas Data frame

4. Results and Discussion4a. Mapping the Neighborhoods by ClustersUsing Folium once again and the new data frame including the top 10 venues in each neighborhood and the Clusterlabels, I mapped out the neighborhoods in both cities. The neighborhoods are color coded by cluster to show the clustergroupings visually.Map of Oxford Neighborhoods (Color Coded by Cluster)4b. Labelling and Initial Analysis by Cluster: OxfordCluster 4 (Orange): Pubs, Shopping Mall, Restaurants, Museums and BarsThis cluster is the largest by a significant margin and includes 17 of the 24 Oxford Neighborhoods. This could be due to anumber of factors including the range of venue types returned by Foursquare. As mentioned in the data section of thisreport, the venue list generated relies on the latitude and longitude coordinates provided for each neighborhood. Ifthese coordinates are not the optimal choice then the venue data may be inaccurate and this could have skewed thecluster results.Cluster 3 (Light Green): Cafes and ParksThis cluster is the second largest with 3 neighborhoods.Cluster 2 (Light Blue): Small Shops and FoodCluster 1 (Purple): Pubs and GymsCluster 0 (Red): Bus Transport, Boutiques and FoodThe remaining clusters were assigned one neighborhood each. It may be that these areas did not have enough venues toproperly cluster them or there were very distinctive venues. However looking at the top three venues listed for clusters0, 1 and 2, this does not seem likely. It is also possible they are heavily residential or zoned for business.

Map of Atlanta Neighborhoods (Color Coded by Cluster)4c. Labelling and Initial Analysis by Cluster: AtlantaCluster 0 (Red): Restaurants, Businesses, Tourist Attractions, Hotels, Breweries, Music Venues, BarsThis is by far the largest cluster of neighborhoods and we can see that neighborhoods across all areas of Atlanta havebeen included in this group. 21 of the 28 neighborhoods in Atlanta have been assigned to this cluster. As with cluster 4from the Oxford data, it may be that the neighborhoods in this cluster have too wide a range of venue results to be veryuseful as a measure of similarity. Clustering based on other data or a subsection of the venue data could be required tobetter categorize these neighborhoods and break them down into smaller and more distinct clusters. It may also be thatthe radius needs to be changed when generating the venue lists from Foursquare.Cluster 1 (Purple): Event Venues, Zoo Exhibits, and Fish MarketCluster 2 (Light Blue): Gyms, Fast Food and Sports StadiumsCluster 3 (Teal): Nature/Parks, Zoo and Fast FoodCluster 4 (Lime green): Residential Apartments, Gay Bars, and Smoke shopsCluster 5 (Orange): Discount shops, Playgrounds and Southern/Soul Food RestaurantsThe remaining clusters have only one neighborhood each. Again this may be due to inaccurate or incomplete venue dataor it may be the result of better clustering than the above Cluster 0.

4d. Comparing Neighborhood Clusters Between CitiesFor both cities we see a similar results pattern in the clustering of neighborhoods. Both have returned one clustercomprising the majority of the neighborhoods, with the remaining clusters generally having one neighborhood each.The most similar clusters between the two cities are these large clusters, Cluster 0 in Atlanta and Cluster 4 in Oxford.However it is clear that more clustering analysis on the basis of other data beyond nearby venues will be required tomore accurately group similar neighborhoods in each city. Even if this is accomplished, the results may still show thatthere are many neighborhood clusters that do not have direct comparison between these two cities. This could be dueto a number of factors, such as the geographical size and layout of the neighborhoods and differences in culture andlifestyle between the US and the UK. Further analysis and investigation is required.It may also be necessary to better clean the venue data returned by Foursquare API. As we can see below, some of thetop venues listed and used in the clustering analysis include uninformative categories such as ‘Bus Stop’ or‘Miscellaneous Shop’ or ‘Discount Store’. This may or may not be a significant venue and could be excluded for morestatistically significant venues. This is something to consider if this project were to be replicated.Top Five Venues in each Cluster: Oxford and Atlanta5. ConclusionsThis project has given us some insight into the amenities in the selected neighborhoods in both Oxford and Atlanta,which partially fulfills the intended purpose of the exercise. The information garnered provides a useful, albeit cursoryand broad, snapshot of each neighborhood. However based on the results it is clear we need more holistic data toimprove the accuracy and usefulness of our neighborhood clustering. If I were to redo this project, I would considerincluding data on population, cost of living, demographics, schools and transportation. I would also better clean thevenue data and ensure that the best map coordinates were being used to represent each neighborhood in order toimprove the accuracy of venue results. Finally, I would consider whether factors such as culture or geographical size andspread are impacting the results and how these could be minimized to better standardize the data and subsequentresults to ensure more accurate comparison.Thank you for reading! This project was created for my Coursera capstone course to complete my IBMProfessional Certificate in Data Science.

Cluster 2 (Light Blue): Gyms, Fast Food and Sports Stadiums Cluster 3 (Teal): Nature/Parks, Zoo and Fast Food Cluster 4 (Lime green): Residential Apartments, Gay Bars, and Smoke shops Cluster 5 (Orange): Discount shops, Playgrounds and Southern/Soul Food Restaurants The remaining clusters have only one neighborhood each.

Related Documents:

The Original Peter Rabbit Books By BEATRIX POTTER

*The Tale of Benjamin Bunny *The Tale of Mrs. Tiggy-Winkle *The Tale of Mr. Jeremy Fisher *The Tale of Jemima Puddle-Duck *The Tale of the Flopsy Bunnies *The Tale of Two Bad Mice *The Tale of Timmy Tiptoes *The Tale of Mr. Tod *The Tale of Pigling Bland *The Roly Poly Pudding *The Pie and the Patty-pan *Ginger and Pickles *The Story of Miss Moppet

49 Views

2y ago

Fairy Tale Printable Pack

Fairy Tale Printable Pack for reading, writing, and storytelling Included in this download are: *Fairy Tale Features Organizer- Displays the qualities of a fairy tale, organized by story elements (pg. 2) *Fairy Tale Features Recording Sheet- Students can jot down the features of a fairy tale as different ones are read to him/her (pg. 3)

40 Views

2y ago

0713 bioinformatics overview

"You are a little -- 168896 A Tale of Two Cities he detected all around him, walked from one to another. The first was the best room, and in it were -- 182459 A Tale of Two Cities g, that chateau of Monsieur the Marquis, with a large stone courtyard before it, and two stone sweep -- 232648 A Tale of Two Cities

12 Views

2y ago

The Chicago Manual of Style - 17th Edition - Alyve

Use headline-style capitalization for titles. Ex. A Tale of Two Cities Italicize titles of longer works such as books and journals. Put quotation marks around the titles of shorter works such as journal articles or essays in edited collections. Ex. A Tale of Two Cities . vs."An Essay on Dickens' A Tale of Two Cities "

15 Views

1y ago

CENTRAL TEXAS COLLEGE SYLLABUS FOR ENGL 2322 British ...

f. Chaucer - “Prologue” to Canterbury Tales, Interlinear translation (current edition in print) AND at least two of the following tales: (1) “The Miller’s Tale” (2) “The Nun’s Priest’s Tale” (3) “The Prioress’s Tale” (4) “The Pardoner’s Tale” (5) “The Franklin’s Tale

20 Views

2y ago

A Tale of Two Cities Study Guide Questions

A Tale of Two Cities Study Guide Questions Book the First: Recalled to Life Chapter 1: The Period & Chapter 2: The Mail & Chapter 3: The Night Shadows 1. What are the two cities of the novel’s title? 2. What purpose does the comparison of England and France serve? 3. What further comparison is implied by the connection of England and France? 4.

19 Views

3y ago

A of Two Cities

A Tale of Two Cities A Tale of Two Cities occupies a central place in the canon of Charles Dickens's works. This novel of the French Revolution was originally serialized in the author's own periodical All the Year Round. Weekly publication of chapters 1-3 of Book 1 be- gan on April 30, 1859. In an innovative move, Dickens simultaneously

15 Views

2y ago

A tale of two cities: Jerusalem and Tel Aviv in an age of ...

The tale of the two cities: the capital and the metropolis A short historical background of the two cities Jerusalem,locatedintheJudeaMountain,issacredto the three main monotheistic religions and is the capi-talcityofIsrael.1 Throughoutitshistory,ithasbee

5 Views

2y ago

Recent Views

Legal Proceedings and Legal Privilege Exemptions: Myth-busting - ICO

If asking for legal advice, say so, and start new email chain If giving legal advice, say so Involve lawyers (before litigation contemplated) Maintain confidentiality of legal advice documents Limit dissemination of legal advice (need to know; original only) Make internal communications re legal advice factual

1y ago

240 Views

Smart People Ask for (My) Advice: Seeking Advice Boosts .

advice strategically is likely to be a different experi-ence for the advice seeker than seeking advice with the intention of using it, from the advisor’s perspec-tive, strategic advice seeking may elicit the same per-ceptual effects as authentic advice seeking because the advice seeker’s intentions (and her reliance on advice)

3y ago

177 Views

Legal Action Group The Role of Advice Services in Health Outcomes

The Role of Advice Services in Health Outcomes Evidence Review and Mapping Study June 2015 The Role of Advice Services in Health Outcomes . tor.!Our! r,!

1y ago

170 Views

Legal Information vs Legal Advice Guidelines - TMCEC

giving legal advice. Legal advice is a written or oral statement that: o Interprets some aspect of the law, court rules, or court procedures; o Recommends a specific course of conduct a person should take in an actual or potential legal proceeding; or o Applies the law to the individual person's specific factual circumstances. What is Legal .

1y ago

225 Views

ProQual L2 Certificate Supporting Access to Legal Advice

R/502/7657 Communicating with legal advice clients 2 3 D/503/0822 Supporting clients to make use of the legal advice service 2 3 R/502/7660 Enabling legal advice clients to access signposting and referral opportunities 2 3 Optional Units - a minimum of 6 credits Unit Reference Number Unit Title Unit Level Credit Value

1y ago

173 Views

Guidance for opponents in civil legal aid cases - Scottish Legal Aid Board

injury case - may apply for civil legal aid (since this leaﬂet deals only with civil legal aid, where we refer to "legal aid" we mean "civil legal aid"). Legal aid is ﬁnancial help from public funds. It helps people who qualify to get legal advice and the help of a solicitor to put their case in court.

4m ago

110 Views

Priority Banking Tariff - Standard Chartered

Foreign exchange rate Free Free Free Free Free Free Free Free Free Free Free Free Free Free Free SMS Banking Daily Weekly Monthly. in USD or in other foreign currencies in VND . IDD rates min. VND 85,000 Annual Rental Fee12 Locker size Small Locker size Medium Locker size Large Rental Deposit12,13 Lock replacement

2y ago

206 Views

legal and ethical dimensions of practice - Dovetail

Material in this Guide should never be taken as providing you or any other person with legal advice. Legal advice regarding the application of the law to a particular circumstance or situation can only come from a legal practitioner. A range of sources for legal advice can be found in the Guide.

1y ago

167 Views

How Social Welfare Legal Advice and Social Prescribing can work .

The position of social welfare legal advice and its role in London's recovery The Mayor of London and partners should position social welfare legal advice as a core pillar of Londons recovery from the OVID-19 pandemic, with a core focus on ensuring adequate funding and practical support for advice agencies to ensure ongoing viability.

1y ago

172 Views

WHAT TO DO IF YOU ARE SEXUALLY HARASSED

There are many legal clinics or legal information centres you can contact to obtain legal information, educational resources or legal referrals. Alberta Central Alberta Community Legal Clinic (Red Deer) Centre for Public Legal Education Alberta Pro Bono Law Alberta Women's Centre Legal Advice Clinic (Calgary)

3y ago

245 Views

Legal Advocacy Essentials

Legal Advocacy Essentials: a core training for legal advocates Presented by the Washington State Coalition Against Domestic Violence, 2008. This information is not intended as a substitute for legal advice. 1 Legal Advocacy Essentials . A core training for legal advocates . Table of Contents . What is a legal advocate?

1y ago

249 Views

Legal & Corporate Services: Strategic Plan - CP6

the provision of legal advice, managing legal risk and managing the legal supply chain. By doing this well, the team will move towards its vision. Legal Services is made up of 4 teams, each serving different customers with a dedicated legal resource. This is summarised in the figure right. Although Legal Services has customerdistinct, -focussed .

1y ago

171 Views

Regulatory Guide RG 90 Example Statement of Advice: Scaled advice for a .

representatives and advisers who give personal advice to retail clients. It explains how and why we have developed an example Statement of Advice (SOA) for scaled advice (i.e. personal advice that is limited in scope) on personal insurance for a new retail client. The example SOA was developed in consultation with stakeholders, and we

1y ago

186 Views

Removal of licence disqualification - Legal Aid WA

agencies, permission must first be obtained from Legal Aid Western Australia. This Kit provides information about the law only and does not constitute legal advice. You should seek legal advice if you have a specific legal problem. Every effort is made to ensure that the information contai

2y ago

253 Views

Legal Information vs - txcourts.gov

giving legal advice. Legal advice is a written or oral statement that: Inter p rets some as ect of th elaw, courtles, or du s; Recomme nd s a pecific c ourse of ndu ters h ld k ein an actual or ntial legal proceeding; or 'sApplies th elaw to individu alperso n seci fic actu circums a . What is Legal Information?

1y ago

174 Views

A Tale Of Two Cities: Battle Of The Neighborhoods Capstone .

It looks like you're using an ad-blocker