mirror of
https://github.com/captn3m0/microsoft-kb-metadata.git
synced 2024-09-19 15:57:01 +00:00
26 lines
978 B
Markdown
26 lines
978 B
Markdown
# Microsoft Knowledge Base metadata
|
|
|
|
This repository hosts a small subset of the Microsoft Knowledgebase metadata. The data in the `data.json` contains the following:
|
|
|
|
1. Date of the KB publication
|
|
2. KB UUID
|
|
3. KB Slug
|
|
4. KB URL
|
|
|
|
The list of KB IDs in the database is scraped from the URLs in `discovery.txt`. The primary usecase of the dataset is to provide a `KB:DATE`
|
|
mapping to other projects.
|
|
|
|
## wip
|
|
|
|
Discovery notes. Need to check the sitemaps more thoroughly before i automate this
|
|
|
|
See https://learn.microsoft.com/_sitemaps/sitemapindex.xml
|
|
|
|
```
|
|
curl --silent https://learn.microsoft.com/_sitemaps/officeupdates_en-us_1.xml | yq -p xml -o c '.urlset.url[]|.loc' >> discovery.txt
|
|
curl --silent https://learn.microsoft.com/_sitemaps/security-updates_en-us_1.xml | yq -p xml -o c '.urlset.url[]|.loc' >> discovery.txt
|
|
```
|
|
|
|
## license
|
|
|
|
Data and code in this repository is licensed under [Creative Commons Zero v1.0 Universal](https://choosealicense.com/licenses/cc0-1.0/). |