November 3, 2017

Introduction

The New York city hotline receives non-urgent community concerns, which are made public by the city through NYC Open Data portal.

The dataset that we've chosen focuses on Noise complaints that were collected in 2016 and indicate ongoing party in a given neighborhood.

In this report we will show few different plots around this data set.

Getting the data

I've chosen this data set beause it includes time series (creation/close date), grouped information (incidents by city/borough) and geoposition of the incidents.

The data has been obtained from the Kaggle dataset 'partyntc'.

Data explanatory analysis

The data set contains 225,414 incidents (observations) with this structure:

## 'data.frame':    225414 obs. of  8 variables:
##  $ Created.Date : POSIXct, format: "2015-12-31 00:01:15" "2015-12-31 00:02:48" ...
##  $ Closed.Date  : POSIXct, format: "2015-12-31 03:48:04" "2015-12-31 04:36:13" ...
##  $ Location.Type: Factor w/ 6 levels "Club/Bar/Restaurant",..: 5 5 4 4 4 4 4 4 4 5 ...
##  $ Incident.Zip : num  10034 10040 10026 11231 10033 ...
##  $ City         : Factor w/ 49 levels "","ARVERNE","ASTORIA",..: 31 31 31 8 31 7 8 8 7 23 ...
##  $ Borough      : Factor w/ 6 levels "BRONX","BROOKLYN",..: 3 3 3 2 3 1 2 2 1 4 ...
##  $ Latitude     : num  40.9 40.9 40.8 40.7 40.9 ...
##  $ Longitude    : num  -73.9 -73.9 -74 -74 -73.9 ...

Incidents by month

This time series plot show the number of noise complains aggregated by month in 2016.

Incidents by Borough

This bar plot shows the incidents per Borough.

Incidents by location type

Pie chart with aggregated incidents by location type.