microsoft-kb-metadata/README.md
2024-07-02 14:25:35 +05:30

26 lines
978 B
Markdown

# Microsoft Knowledge Base metadata
This repository hosts a small subset of the Microsoft Knowledgebase metadata. The data in the `data.json` contains the following:
1. Date of the KB publication
2. KB UUID
3. KB Slug
4. KB URL
The list of KB IDs in the database is scraped from the URLs in `discovery.txt`. The primary usecase of the dataset is to provide a `KB:DATE`
mapping to other projects.
## wip
Discovery notes. Need to check the sitemaps more thoroughly before i automate this
See https://learn.microsoft.com/_sitemaps/sitemapindex.xml
```
curl --silent https://learn.microsoft.com/_sitemaps/officeupdates_en-us_1.xml | yq -p xml -o c '.urlset.url[]|.loc' >> discovery.txt
curl --silent https://learn.microsoft.com/_sitemaps/security-updates_en-us_1.xml | yq -p xml -o c '.urlset.url[]|.loc' >> discovery.txt
```
## license
Data and code in this repository is licensed under [Creative Commons Zero v1.0 Universal](https://choosealicense.com/licenses/cc0-1.0/).