Keep track of restaurant openings and closures in the city. https://twitter.com/BLRFoodCensus
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Nemo 5921640da3 Now, although it takes 5GB to run this once, we get a more detailed CSV 1 month ago
.gitignore Move to data/ directory 2 years ago
Gemfile Removes unused gems 2 years ago
Gemfile.lock Upgrade bundler 1 month ago
README.md Adds stats script 1 year ago
bootstrap.sh Run parallel and only download missing files 1 month ago
parse_dir.rb Initial commit 2 years ago
parse_listing.rb Now, although it takes 5GB to run this once, we get a more detailed CSV 1 month ago
parse_restaurants.rb Now, although it takes 5GB to run this once, we get a more detailed CSV 1 month ago
stats.rb Adds stats script 1 year ago

README.md

bengaluru-food-census

Keep track of restaurant openings and closures in the city.

Quirks

  • Zomato does not support HTTP/1.1, so wget can't be used.

Tech

This project uses GNU Parallel, Ruby, Nokogiri, and curl.

Features

  • Keep track of historical data using regularly generated CSV files
  • Does not use the API (since the rate-limit is too low at 1k/day)
    • We need to checkout around 15k restaurant status (closed or not)
  • Keep track of whether restaurant is still alive or not
  • Tweet any restaurant closures (or any new openings)

For now, run the following command to get a diff of new restaurants not in the old listings:

q -d , "SELECT * from ./2018-MM-DD.csv WHERE c1 not in (SELECT c1 from 2018-MM-DD.csv)"