ExpDev07 · Kilo59 · Apr 13, 2020 · Apr 3, 2020 · Apr 3, 2020 · Apr 7, 2020
diff --git a/README.md b/README.md
@@ -24,18 +24,24 @@ Support multiple data-sources.
 ![Covid-19 Recovered](https://covid19-badges.herokuapp.com/recovered/latest)
 ![Covid-19 Deaths](https://covid19-badges.herokuapp.com/deaths/latest)
 
+## New York Times is now available as a source!
+
+**Specify source parameter with ?source=nyt. NYT also provides a timeseries! To view timelines of cases by US counties use ?source=nyt&timelines=true**
+
 ## Recovered cases showing 0
 
-**JHU (our main data provider) [no longer provides data for amount of recoveries](https://github.com/CSSEGISandData/COVID-19/issues/1250), and as a result, the API will be showing 0 for this statistic. Apolegies for any inconvenience. Hopefully we'll be able to find an alternative data-source that offers this.**
+**JHU (our main data provider) [no longer provides data for amount of recoveries](https://github.com/CSSEGISandData/COVID-19/issues/1250), and as a result, the API will be showing 0 for this statistic. Apologies for any inconvenience. Hopefully we'll be able to find an alternative data-source that offers this.**
 
 ## Available data-sources:
 
-Currently 2 different data-sources are available to retrieve the data:
+Currently 3 different data-sources are available to retrieve the data:
 
 * **jhu** - https://github.com/CSSEGISandData/COVID-19 - Worldwide Data repository operated by the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE).
 
 * **csbs** - https://www.csbs.org/information-covid-19-coronavirus - U.S. County data that comes from the Conference of State Bank Supervisors.
 
+* **nyt** - https://github.com/nytimes/covid-19-data - The New York Times is releasing a series of data files with cumulative counts of coronavirus cases in the United States, at the state and county level, over time. 
+
 __jhu__ data-source will be used as a default source if you don't specify a *source parameter* in your request.
 
 ## API Reference
@@ -71,7 +77,8 @@ __Sample response__
 {
     "sources": [
         "jhu",
-        "csbs"
+        "csbs",
+        "nyt"
     ]
 }
 ```
@@ -87,7 +94,7 @@ GET /v2/latest
 __Query String Parameters__
 | __Query string parameter__ | __Description__                                                                  | __Type__ |
 | -------------------------- | -------------------------------------------------------------------------------- | -------- |
-| source                     | The data-source where data will be retrieved from *(jhu/csbs)*. Default is *jhu* | String   |
+| source                     | The data-source where data will be retrieved from *(jhu/csbs/nyt)*. Default is *jhu* | String   |
 
 __Sample response__
 ```json
@@ -117,7 +124,7 @@ __Path Parameters__
 __Query String Parameters__
 | __Query string parameter__ | __Description__                                                                  | __Type__ |
 | -------------------------- | -------------------------------------------------------------------------------- | -------- |
-| source                     | The data-source where data will be retrieved from *(jhu/csbs)*. Default is *jhu* | String   |
+| source                     | The data-source where data will be retrieved from *(jhu/csbs/nyt)*. Default is *jhu* | String   |
 
 #### Example Request
 ```http
@@ -160,7 +167,7 @@ GET /v2/locations
 __Query String Parameters__
 | __Query string parameter__ | __Description__                                                                                                                                  | __Type__ |
 | -------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------ | -------- |
-| source                     | The data-source where data will be retrieved from.<br>__Value__ can be: *jhu/csbs*. __Default__ is *jhu*                                         | String   |
+| source                     | The data-source where data will be retrieved from.<br>__Value__ can be: *jhu/csbs/nyt*. __Default__ is *jhu*                                         | String   |
 | country_code               | The ISO ([alpha-2 country_code](https://en.wikipedia.org/wiki/ISO_3166-1_alpha-2)) to the Country/Province for which you're calling the Endpoint | String   |
 | timelines                  | To set the visibility of timelines (*daily tracking*).<br>__Value__ can be: *0/1*. __Default__ is *0* (timelines are not visible)                | Integer  |
 

diff --git a/app/data/__init__.py b/app/data/__init__.py
@@ -1,9 +1,10 @@
 """app.data"""
 from ..services.location.csbs import CSBSLocationService
 from ..services.location.jhu import JhuLocationService
+from ..services.location.nyt import NYTLocationService
 
 # Mapping of services to data-sources.
-DATA_SOURCES = {"jhu": JhuLocationService(), "csbs": CSBSLocationService()}
+DATA_SOURCES = {"jhu": JhuLocationService(), "csbs": CSBSLocationService(), "nyt": NYTLocationService()}
 
 
 def data_source(source):

diff --git a/app/enums/sources.py b/app/enums/sources.py
@@ -8,3 +8,4 @@ class Sources(str, Enum):
 
     jhu = "jhu"
     csbs = "csbs"
+    nyt = "nyt"
diff --git a/app/location/nyt.py b/app/location/nyt.py
@@ -0,0 +1,32 @@
+"""app.locations.nyt.py"""
+from . import TimelinedLocation
+
+
+class NYTLocation(TimelinedLocation):
+    """
+    A NYT (county) Timelinedlocation.
+    """
+
+    # pylint: disable=too-many-arguments,redefined-builtin
+    def __init__(self, id, state, county, coordinates, last_updated, timelines):
+        super().__init__(id, "US", state, coordinates, last_updated, timelines)
+
+        self.state = state
+        self.county = county
+
+    def serialize(self, timelines=False):  # pylint: disable=arguments-differ,unused-argument
+        """
+        Serializes the location into a dict.
+
+        :returns: The serialized location.
+        :rtype: dict
+        """
+        serialized = super().serialize(timelines)
+
+        # Update with new fields.
+        serialized.update(
+            {"state": self.state, "county": self.county,}
+        )
+
+        # Return the serialized location.
+        return serialized
diff --git a/app/services/location/nyt.py b/app/services/location/nyt.py
@@ -0,0 +1,123 @@
+"""app.services.location.nyt.py"""
+import csv
+from datetime import datetime
+
+from asyncache import cached
+from cachetools import TTLCache
+
+from ...coordinates import Coordinates
+from ...location.nyt import NYTLocation
+from ...timeline import Timeline
+from ...utils import httputils
+from . import LocationService
+
+
+class NYTLocationService(LocationService):
+    """
+    Service for retrieving locations from New York Times (https://github.com/nytimes/covid-19-data).
+    """
+
+    async def get_all(self):
+        # Get the locations.
+        locations = await get_locations()
+        return locations
+
+    async def get(self, loc_id):  # pylint: disable=arguments-differ
+        # Get location at the index equal to provided id.
+        locations = await self.get_all()
+        return locations[loc_id]
+
+
+# ---------------------------------------------------------------
+
+
+# Base URL for fetching category.
+BASE_URL = "https://raw.githubusercontent.com/nytimes/covid-19-data/master/us-counties.csv"
+
+
+def get_grouped_locations_dict(data):
+    """
+    Helper function to group history for locations into one dict.
+
+    :returns: The complete data for each unique US county
+    :rdata: dict
+    """
+    grouped_locations = {}
+
+    # in increasing order of dates
+    for row in data:
+        county_state = (row["county"], row["state"])
+        date = row["date"]
+        confirmed = row["cases"]
+        deaths = row["deaths"]
+
+        # initialize if not existing
+        if county_state not in grouped_locations:
+            grouped_locations[county_state] = {"confirmed": [], "deaths": []}
+
+        # append confirmed tuple to county_state (date, # confirmed)
+        grouped_locations[county_state]["confirmed"].append((date, confirmed))
+        # append deaths tuple to county_state (date, # deaths)
+        grouped_locations[county_state]["deaths"].append((date, deaths))
+
+    return grouped_locations
+
+
+@cached(cache=TTLCache(maxsize=1024, ttl=3600))
+async def get_locations():
+    """
+    Returns a list containing parsed NYT data by US county. The data is cached for 1 hour.
+
+    :returns: The complete data for US Counties.
+    :rtype: dict
+    """
+
+    # Request the data.
+    async with httputils.CLIENT_SESSION.get(BASE_URL) as response:
+        text = await response.text()
+
+    # Parse the CSV.
+    data = list(csv.DictReader(text.splitlines()))
+
+    # Group together locations (NYT data ordered by dates not location).
+    grouped_locations = get_grouped_locations_dict(data)
+
+    # The normalized locations.
+    locations = []
+
+    for idx, (county_state, histories) in enumerate(grouped_locations.items()):
+        # Make location history for confirmed and deaths from dates.
+        # List is tuples of (date, amount) in order of increasing dates.
+        confirmed_list = histories["confirmed"]
+        confirmed_history = {date: int(amount or 0) for date, amount in confirmed_list}
+
+        deaths_list = histories["deaths"]
+        deaths_history = {date: int(amount or 0) for date, amount in deaths_list}
+
+        # Normalize the item and append to locations.
+        locations.append(
+            NYTLocation(
+                id=idx,
+                state=county_state[1],
+                county=county_state[0],
+                coordinates=Coordinates(None, None),  # NYT does not provide coordinates
+                last_updated=datetime.utcnow().isoformat() + "Z",  # since last request
+                timelines={
+                    "confirmed": Timeline(
+                        {
+                            datetime.strptime(date, "%Y-%m-%d").isoformat() + "Z": amount
+                            for date, amount in confirmed_history.items()
+                        }
+                    ),
+                    "deaths": Timeline(
+                        {
+                            datetime.strptime(date, "%Y-%m-%d").isoformat() + "Z": amount
+                            for date, amount in deaths_history.items()
+                        }
+                    ),
+                    "recovered": Timeline({}),
+                },
+            )
+        )
+
+    return locations
diff --git a/tests/example_data/counties.csv b/tests/example_data/counties.csv
@@ -0,0 +1,49 @@
+date,county,state,fips,cases,deaths
+2020-01-21,Snohomish,Washington,53061,1,0
+2020-01-22,Snohomish,Washington,53061,1,0
+2020-01-23,Snohomish,Washington,53061,1,0
+2020-01-24,Cook,Illinois,17031,1,0
+2020-01-24,Snohomish,Washington,53061,1,0
+2020-01-25,Orange,California,06059,1,0
+2020-01-25,Cook,Illinois,17031,1,0
+2020-01-25,Snohomish,Washington,53061,1,0
+2020-01-26,Maricopa,Arizona,04013,1,0
+2020-01-26,Los Angeles,California,06037,1,0
+2020-01-26,Orange,California,06059,1,0
+2020-01-26,Cook,Illinois,17031,1,0
+2020-01-26,Snohomish,Washington,53061,1,0
+2020-01-27,Maricopa,Arizona,04013,1,0
+2020-01-27,Los Angeles,California,06037,1,0
+2020-01-27,Orange,California,06059,1,0
+2020-01-27,Cook,Illinois,17031,1,0
+2020-01-27,Snohomish,Washington,53061,1,0
+2020-01-28,Maricopa,Arizona,04013,1,0
+2020-01-28,Los Angeles,California,06037,1,0
+2020-01-28,Orange,California,06059,1,0
+2020-01-28,Cook,Illinois,17031,1,0
+2020-01-28,Snohomish,Washington,53061,1,0
+2020-01-29,Maricopa,Arizona,04013,1,0
+2020-01-29,Los Angeles,California,06037,1,0
+2020-01-29,Orange,California,06059,1,0
+2020-01-29,Cook,Illinois,17031,1,0
+2020-01-29,Snohomish,Washington,53061,1,0
+2020-01-30,Maricopa,Arizona,04013,1,0
+2020-01-30,Los Angeles,California,06037,1,0
+2020-01-30,Orange,California,06059,1,0
+2020-01-30,Cook,Illinois,17031,2,0
+2020-01-30,Snohomish,Washington,53061,1,0
+2020-01-31,Maricopa,Arizona,04013,1,0
+2020-01-31,Los Angeles,California,06037,1,0
+2020-01-31,Orange,California,06059,1,0
+2020-01-31,Santa Clara,California,06085,1,0
+2020-01-31,Cook,Illinois,17031,2,0
+2020-01-31,Snohomish,Washington,53061,1,0
+2020-02-28,Snohomish,Washington,53061,2,0
+2020-03-10,Snohomish,Washington,53061,61,0
+2020-03-11,Snohomish,Washington,53061,69,1
+2020-03-12,Snohomish,Washington,53061,107,3
+2020-03-15,Snohomish,Washington,53061,175,3
+2020-03-17,Snohomish,Washington,53061,265,4
+2020-03-18,Snohomish,Washington,53061,309,5
+2020-03-19,Snohomish,Washington,53061,347,6
+2020-03-20,Snohomish,Washington,53061,384,7
Original file line number	Diff line number	Diff line change
Expand Up		@@ -8,3 +8,4 @@ class Sources(str, Enum):

		jhu = "jhu"
		csbs = "csbs"
		nyt = "nyt"