We've been collecting and mirroring what we can find of public data scrapes of data that has recently gone missing from federal sites or is likely to in the near future. The repos here include public data from CDC, NIH, and NOAA. Be warned that some of these repos are quite large!
https://git.lsit.ucsb.edu/publicdata
[#]datascience #cdc #nih #noaa
=> More informations about this toot | More toots from vwbusguy@mastodon.online
Stay tuned. There's more on the way!
=> More informations about this toot | More toots from vwbusguy@mastodon.online
Now including data from #DeptEd as well. Added more CDC data. More on the way!
https://git.lsit.ucsb.edu/publicdata/DeptEd
=> More informations about this toot | More toots from vwbusguy@mastodon.online
Here's the additional #CDC data that wasn't included in the original dump, with some of these reports going back decades, as well as reports on LGBTQ and HIV/AIDS.
https://git.lsit.ucsb.edu/publicdata/CDC-Data-2025/src/branch/main/other_reports
=> More informations about this toot | More toots from vwbusguy@mastodon.online
And now, #Climate data archive from #ClimateGov
https://git.lsit.ucsb.edu/publicdata/climate-gov-data
[#]Climate #datascience #archive
=> More informations about this toot | More toots from vwbusguy@mastodon.online
Added more DeptEd and CDC data today and added globalchange.gov data.
https://git.lsit.ucsb.edu/publicdata/globalchange-gov
=> More informations about this toot | More toots from vwbusguy@mastodon.online
I do appreciate those who having pointed out some data that I've missed or otherwise haven't archived yet. Please do let me know if you see such things. Unfortunately, some data has use restrictions and I'm only hosting public data here. If it's not public domain or clearly marked creative commons, etc, then I can't host it here.
=> More informations about this toot | More toots from vwbusguy@mastodon.online
@vwbusguy @adamhsparks Does this have public data files for the NCES sample surveys? Like Baccalaureate and Beyond?
=> More informations about this toot | More toots from DataAngler@vis.social
@DataAngler @adamhsparks There's a bunch of stuff in a different web layout that I haven't gotten yet. The two sections I got were the most straightforward to find the data. Some of the other sections of the site used asp link generators for the downloads that I haven't tried to fetch yet.
=> More informations about this toot | More toots from vwbusguy@mastodon.online
@DataAngler @adamhsparks I just added surveys and EDGE program data. Does this have what you're looking for?
=> More informations about this toot | More toots from vwbusguy@mastodon.online
@vwbusguy @adamhsparks I actually seem to think that those public-use datafiles are no longer available. Maybe NCES changed to only letting people interact with sample surveys through their online tools. (Only academic researchers can apply to use the restricted-use datafiles.) The list of sample surveys is here: https://nces.ed.gov/datalab
=> More informations about this toot | More toots from DataAngler@vis.social
@DataAngler @adamhsparks Some of these appear to be "restricted use", which means I'm not sure if I can legally host it here, but that doesn't mean you can't legally download it yourself.
https://nces.ed.gov/surveys/b&b/
=> More informations about this toot | More toots from vwbusguy@mastodon.online
@DataAngler @adamhsparks It looks like B&B might already be offline?
https://nces.ed.gov/pubsearch/pubsinfo.asp?pubid=2024483
=> More informations about this toot | More toots from vwbusguy@mastodon.online
@DataAngler @adamhsparks But yeah, looks like anything in DataLab isn't going to be something I can host here, given that it's not public domain, creative commons, etc.
https://nces.ed.gov/datalab/membership/login
=> More informations about this toot | More toots from vwbusguy@mastodon.online
@vwbusguy @adamhsparks Some surveys do have public-use datafiles, like the Early Childhood Studies Program.
https://nces.ed.gov/ecls/dataproducts.asp
=> More informations about this toot | More toots from DataAngler@vis.social
@DataAngler @adamhsparks Good catch! I'll add the public ECLS data shortly.
=> More informations about this toot | More toots from vwbusguy@mastodon.online
@DataAngler @adamhsparks I added ECLS and the public Longitudinal data (under surveys).
=> More informations about this toot | More toots from vwbusguy@mastodon.online This content has been proxied by September (3851b).Proxy Information
text/gemini