本文介绍了学习数据分析时难以找到合适数据集的问题,并提供了来自七个领域的70多个数据集资源,包括交通、气象、能源、信息安全、医学、游戏和经济金融等,每个数据集均附有下载链接。目的是帮助读者找到适合的数据集进行实操练习,提高数据分析能力。
学习数据分析需要持续进行实操,但很多读者找不到合适的数据集来练手,小编整理了7个领域70+数据集,总有一个是适合你练手的数据集!赶紧收藏点赞吧!
01、交通类数据集
1.Pronto共享单车数据集(70.8MB)
https://www.heywhale.com/mw/dataset/58a515c48460306efcce2e96
2.欧洲航空旅客运输季度数据集(更新至2019第二季度)(63KB)
https://www.heywhale.com/mw/dataset/5d8d7af3037db3002d3a3685
3.2015年美国✈✈航班延误或取消数据集(192.3MB)
https://www.heywhale.com/mw/dataset/5d7f482a8499bc002c0dd67e
4.明尼阿波里斯市交通流量数据(3.1MB)
https://www.heywhale.com/mw/dataset/5d521a3ac143cf002b21ec27
5.航班动态起降数据集(2GB)
https://www.heywhale.com/mw/dataset/59793a5a0d84640e9b2fedd3
6.交通卡口过车数据数据集(100G)
https://www.heywhale.com/mw/dataset/5d8d7af3037db3002d3a3685
7.Uber 纽约市乘车数据(109.1MB)
https://www.heywhale.com/mw/dataset/5de9ed47953ca8002c95d1e0
8.mobike 骑行数据集(43.7MB)
https://www.heywhale.com/mw/dataset/5eb6787e366f4d002d77c331
9.2017-2020年共享单车BikeShare多伦多数据(201.6MB)
https://www.heywhale.com/mw/dataset/60001de17ed5ab0015ecdaa9
10.中国主要城市火车站代码(4.5KB)
https://www.heywhale.com/mw/dataset/5ffffe433441fd00153d48c1
02、气象数据集
1.中国历年台风最佳路径数据
https://www.heywhale.com/mw/dataset/5d1f19119f53a9002ce5c812
2.1750年至今全球地表气温变化数据
https://www.heywhale.com/mw/dataset/58d1252b97c4b112cbb82b98
3.1965-2016全球重大地震数据(2.3MB)
https://www.heywhale.com/mw/dataset/58d1172797c4b112cbb826db
4.El Nino厄尔尼诺数据集(9.6MB)
https://www.heywhale.com/mw/dataset/5d8473418499bc002c0f782a
5.中国气象数据(675.1MB)
https://www.heywhale.com/mw/dataset/5d240466688d36002c562b0c
6.北京空气质量数据(21.5MB)
https://www.heywhale.com/mw/dataset/5d240057688d36002c5623fd
7.中国空气质量数据集(1.2GB)
https://www.heywhale.com/mw/dataset/5eb6787e366f4d002d77c331
8.澳大利亚山火数据集(100+MB)
https://www.heywhale.com/mw/dataset/5e21588a2823a10036b575bf
9.1750年至今全球地表气温变化数据(84MB)
https://www.heywhale.com/mw/dataset/58d1252b97c4b112cbb82b98
03、能源数据集
1.全球能源之风力预测数据集(24.5MB)
https://www.heywhale.com/mw/dataset/5c91b35ab4536a002bcf600e
2.风力发电机数据集(12.3MB)
https://www.heywhale.com/mw/dataset/5c91ddbfb4536a002bcf7ac4
3.2010年芝加哥能源使用情况(26.3MB)
https://www.heywhale.com/mw/dataset/5d8c8e24037db3002d3a0168
4.中国水资源数据集(130KB+)
https://www.heywhale.com/mw/dataset/5d6f365d8499bc002c0a564f
5.镇江电力数据(17.7MB)
https://www.heywhale.com/mw/dataset/5d898e415ca5eb002c7940f6/file
04、信息安全数据集
1.CNNVD中国信息安全漏洞数据库
https://www.heywhale.com/mw/dataset/5d81a3088499bc002c0e7642
2.NVD美国国家通用漏洞数据库(335MB+)
https://www.heywhale.com/mw/dataset/5d81ab4c8499bc002c0e7baf
3.NSL_KDD数据集(25.3MB)
https://www.heywhale.com/mw/dataset/5d8325648499bc002c0ef79a
4.KDD-CUP99网络入侵检测数据集(220MB+)
https://www.heywhale.com/mw/dataset/5d80a0458499bc002c0e3c7d
05、医学数据集
1.心脏病诊断数据集(17.6KB)
https://www.heywhale.com/mw/dataset/5bbde6233631bc00109c3704
2.骨科患者的生物力学特征数据集(24.4KB)
https://www.heywhale.com/mw/dataset/5bfe52ce954d6e0010682abf
3.埃博拉数据集(1.3MB)
https://www.heywhale.com/mw/dataset/5d75cb4c8499bc002c0bc6e2
4.癫痫发作识别数据集(7.3MB)
https://www.heywhale.com/mw/dataset/5d6f83098499bc002c0a6d1c
5.1000个12导联ECG心电图数据集(70.7MB)
https://www.heywhale.com/mw/dataset/5d678efa8499bc002c08c8f4/file
6.宫颈癌风险因素数据集(99.7KB)
https://www.heywhale.com/mw/dataset/5d5a49ab8499bc002c045bf5
7.手势检测数据集(16.9MB)
https://www.heywhale.com/mw/dataset/5d536d25c143cf002b227b2e
8.帕金森疾病诊断数据集(5.1MB)
https://www.heywhale.com/mw/dataset/5d5366e5c143cf002b22792d
9.心脏病相关数据集(11.1KB)
https://www.heywhale.com/mw/dataset/5d303eeacf76a60036e16d00
10.基于重症监护室(ICU)多生命体征,预测脓毒症(21.1KB)
https://www.heywhale.com/mw/dataset/5daeb6da75df5c002b212e23
11.内置动脉插管数据集(288.7KB)
https://www.heywhale.com/mw/dataset/5daec20075df5c002b21362a
12.埃及患者的丙型肝炎病毒(HCV)数据集(158KB)
https://www.heywhale.com/mw/dataset/5da984f3c83fb400420faa72
13.急性肝功能衰竭预测数据集(848.6KB)
https://www.heywhale.com/mw/dataset/5da18334037db3002d445d79
14.心血管疾病数据集(2.8MB)
https://www.heywhale.com/mw/dataset/5d9ecea5037db3002d3f0a49
15.新型冠状病毒(2019-nCoV)疫情时间序列数据集(1.8MB)
https://www.heywhale.com/mw/dataset/5e3a6cf1b8c462002d66c4a6
06、游戏数据集
1.英雄联盟英雄数据(253.6KB)
https://www.heywhale.com/mw/dataset/5d22b238688d36002c54c6c1
2.Steam游戏汇总
https://www.heywhale.com/mw/dataset/5d1f0dde9f53a9002ce5bbad
3.100万数独游戏(156.4MB)
https://www.heywhale.com/mw/dataset/5d8052a18499bc002c0e115d
4.守望先锋英雄数据集(53.4KB)
https://www.heywhale.com/mw/dataset/5d6628b98499bc002c085abb
5.Dota2游戏结果数据集(21.3MB)
https://www.heywhale.com/mw/dataset/5d5a4b578499bc002c045d2c
6.王者荣耀英雄数据集(7.8KB)
https://www.heywhale.com/mw/dataset/5d22a87b688d36002c54bb91
6.包含用户行为的Steam游戏数据集(8.5MB)
https://www.heywhale.com/mw/dataset/5da18725037db3002d4468b3
07、经济金融数据集
1.拍拍贷互联网金融数据(320MB)
https://www.heywhale.com/mw/dataset/593ccb4523168e6e8923ab7f
2.P2P信贷平台业务数据(400MB+)
https://www.heywhale.com/mw/dataset/593ccb4523168e6e8923ab7f
3.4万条信用贷款业务、4千条现金贷业务脱敏数据
https://www.heywhale.com/mw/dataset/58e4663a9ed26b1e09bfbaaf
4.上证A股个股日线数据(187.6MB)
https://www.heywhale.com/mw/dataset/58e4663a9ed26b1e09bfbaaf
5.信用卡评分模型构建数据(7.2MB)
https://www.heywhale.com/mw/dataset/5d0b261ae727f8002c84b156
6.1978至今的黄金价格数据(187.8KB)
https://www.heywhale.com/mw/dataset/5d313423cf76a60036e4bed0
7.成人人口普查收入的相关信息(3.9MB)
https://www.heywhale.com/mw/dataset/5d1ebb0b9f53a9002ce5646d
8.我国主要城市年度数据-产值、人口、就业、教育等(10KB)
https://www.heywhale.com/mw/dataset/5c3bf64ee8dbbb002b7bb589
9.金融风险预测数据集(2.1MB)
https://www.heywhale.com/mw/dataset/5d26cf27688d36002c58c790
10.9000条信用卡使用情况数据(881.7KB)
https://www.heywhale.com/mw/dataset/5d269d8d688d36002c5896a9
11.加密货币市场价格(39.0MB)
https://www.heywhale.com/mw/dataset/5c6bcda25136ba002b53537b
12.常用汇率过去3年的历史数据(31.6KB)
https://www.heywhale.com/mw/dataset/5c414ef989f4aa002b848642
13.信用卡欺诈检测数据集(143.8MB)
https://www.heywhale.com/mw/dataset/5b56a592fc7e9000103c0442
14.信用违约概率预测| Kaggle(7.2MB)
https://www.heywhale.com/mw/project/5dadfeb675df5c002b20fa45
15.银行电话营销数据集(2.1MB)
https://www.heywhale.com/mw/dataset/5caee0fae0ad99002cac0472
16.LendingClub贷款数据(421.3MB)
https://www.heywhale.com/mw/dataset/58a7fab4fbe7a30f28357645
17.比特币历史交易数据(221.1MB)
https://www.heywhale.com/mw/dataset/5ca5bc8f8408c1002b4a1f16
18.Dow Jones 股票日结算数据(1.6MB)
https://www.heywhale.com/mw/dataset/5c662c335136ba002b524be6
19.S&P 500股价数据(128.1MB)
https://www.heywhale.com/mw/dataset/5bbdc2513631bc00109c29a4/file
20.PyPortfolioOpt股票价格(1.1MB)
https://www.heywhale.com/mw/dataset/5d09ea4be727f8002c7c1231
21.特斯拉股票价格(168.7KB)
https://www.heywhale.com/mw/dataset/5d00c4fce727f8002c45a45d
22.苹果股票市场数据历史记录(41.2KB)
https://www.heywhale.com/mw/dataset/5d01b6c3e727f8002c4ac22f
23.7家顶级公司的收购数据(69.4KB)
https://www.heywhale.com/mw/dataset/5d380723cf76a6003602b4d9
24.美国医疗保险市场数据(778.8MB)
https://www.heywhale.com/mw/dataset/5d78bc688499bc002c0cb7c1
25.印度贸易数据(19.0MB)
https://www.heywhale.com/mw/dataset/5d6cd7ce8499bc002c09d9b2
26.30家在美国的大型公司的道琼斯指数数据(2.7MB)
https://www.heywhale.com/mw/dataset/5d5a3d15c143cf002b23fe5f
27.最近十年谷歌的股票价格数据集(165.5KB)
https://www.heywhale.com/mw/dataset/5d3fafd9cf76a6003622b209
28.桑坦德银行顾客交易预测数据(244.3MB)
https://www.heywhale.com/mw/dataset/5d0add44e727f8002c824b39
29.欧元兑换国际主要货币的汇率日数据(更至2019.9.26)(2.9MB)
https://www.heywhale.com/mw/dataset/5d8d823c037db3002d3a3af8
30.众筹网站Kickstarter项目数据集(55.3MB)
https://www.heywhale.com/mw/dataset/5def27202823a10036aaa562
31.MT4历史数据中心各货币对外汇交易数据(905.5MB)
https://www.heywhale.com/mw/dataset/5de0c063ca27f8002c4b1401
32.Santander客户价值预测数据集(31.5MB)
https://www.heywhale.com/mw/dataset/5dedda81953ca8002c96605a
延伸阅读:
【1】数据分析师如何构建数据指标体系?理解以下四个模型就够了!
https://vip.kingdee.com/article/296943831369590016
【2】数据分析师如何正确的提建议?
https://vip.kingdee.com/article/296937594120846336
【3】数据集(二)|10个领域70+数据集分享,赶紧收藏!
https://vip.kingdee.com/article/297369594749271040
【4】数据集(三)|人工智能领域100+数据集分享,赶紧收藏!
https://vip.kingdee.com/article/297367760646876416
来源:微信公众号【数据万花筒】
发布于 数据智能 社群
推荐阅读