microsoft-kb-metadata/README.md

26 lines
978 B
Markdown
Raw Normal View History

2023-01-12 10:12:13 +00:00
# Microsoft Knowledge Base metadata
2023-01-12 09:08:22 +00:00
This repository hosts a small subset of the Microsoft Knowledgebase metadata. The data in the `data.json` contains the following:
1. Date of the KB publication
2. KB UUID
3. KB Slug
2023-01-12 10:12:13 +00:00
4. KB URL
2023-01-12 09:08:22 +00:00
2023-01-12 10:12:13 +00:00
The list of KB IDs in the database is scraped from the URLs in `discovery.txt`. The primary usecase of the dataset is to provide a `KB:DATE`
mapping to other projects.
2024-07-02 08:55:35 +00:00
## wip
Discovery notes. Need to check the sitemaps more thoroughly before i automate this
See https://learn.microsoft.com/_sitemaps/sitemapindex.xml
```
curl --silent https://learn.microsoft.com/_sitemaps/officeupdates_en-us_1.xml | yq -p xml -o c '.urlset.url[]|.loc' >> discovery.txt
curl --silent https://learn.microsoft.com/_sitemaps/security-updates_en-us_1.xml | yq -p xml -o c '.urlset.url[]|.loc' >> discovery.txt
```
2023-01-12 10:12:13 +00:00
## license
2023-01-12 10:25:16 +00:00
Data and code in this repository is licensed under [Creative Commons Zero v1.0 Universal](https://choosealicense.com/licenses/cc0-1.0/).