🏡 index : github.com/captn3m0/stackshare-dataset.git

stackshare-dataset DOI

DOI: 10.5281/zenodo.10554437

A dataset from stackshare.io providing lists of packages and various services. While a list of packages for various ecosystems is easily available elsewhere, a list of services is much harder.

See tools.csv for a complete list. I'd recommend sorting by populatity and using the top 2.5-3k results depending on your usecase.

Browse the dataset here: https://flatgithub.com/captn3m0/stackshare-dataset?filename=tools.csv&sort=popularity%2Cdesc&stickyColumnName=url.

License

This stackshare-dataset is made available under the Open Database License: http://opendatacommons.org/licenses/odbl/1.0/. Some individual contents of the database are under copyright by Stackshare.

You are free:

As long as you:

Generating

Ensure you have GNU Make, Python, and wget installed

make tools.csv

The scraper uses the following as sources:

  1. Sitemap (https://stackshare.io/sitemap.xml)
  2. StackShare Search for enriching service results (https://stackshare.io/search)

The package results are not enriched, since much better data for those is available elsewhere.