心慌意乱是什么意思| 什么手机拍照效果最好| 有出息是什么意思| 洋葱为什么会让人流泪| 有氧运动是什么意思| 才美不外见的见是什么意思| 压脚背有什么好处| 玲珑是什么意思| 财星是什么意思| 为什么吃辣的就拉肚子| 打喷嚏代表什么| 殳是什么意思| 申酉是什么时间| 羲什么意思| herb是什么意思| 刮痧不出痧是什么原因| 莴笋不能和什么一起吃| 日前是什么意思| 凿壁偷光告诉我们什么道理| 一个草字头一个氏念什么| 玛丽苏什么意思| 李世民属什么生肖| 什么蔬菜含维生素d| 西洋参和人参有什么区别| 生肖狗和什么生肖相冲| 血小板低吃什么补得快| 四川九寨沟什么时候去最好| 红枣什么时候吃最好| 嗓子发炎吃什么| 一什么绿毯| 步后尘是什么意思| 玫瑰疹是什么病| 鸡蛋壳属于什么垃圾| 虚妄是什么意思| 斐乐是什么档次| 精华液是干什么的| po是什么| 秋葵什么时候播种| 没有味觉是什么病| 前列腺增生吃什么药| 头什么脚什么| 靖康耻指的是什么历史事件| 令尹是什么官职| 心三联是指什么| ny是什么品牌| 筑基是什么意思| 梦到自己流鼻血是什么预兆| 甘油三酯查什么项目| 淼淼是什么意思| 葡萄上的白霜是什么| 二尖瓣反流吃什么药| 人中长痘痘什么原因| 经常想睡觉是什么原因| 梦见两条大蟒蛇是什么征兆| tbc是什么意思| 老匹夫是什么意思| 什么叫空调病| 返流性食管炎用什么药| 尿酸高是什么症状| 苏慧伦为什么不老| 什么的故事填词语| 儿童身高矮小挂什么科| 供血不足吃什么药好| 孔子是什么家| 重色轻友什么意思| amo是什么意思| 离子四项是检查什么的| 吃饭咬到舌头什么原因| 香其酱是什么酱| 肩周炎看什么科| 翡翠是什么材质| 失联是什么意思| 什么样的阳光填形容词| 男生13厘米属于什么水平| 每天都做梦是什么原因| 尿酸低会引发什么症状| 浮肿是什么原因引起的| 耄耋之年是什么意思| 什么人适合吃人参| 打脚是什么意思| 儿童水痘吃什么药| 性格是什么意思| 朝鲜韩国什么时候分开的| 办理健康证需要什么| 眷属是什么意思| 年薪20万算什么水平| 天孤星是什么意思| 关节退行性改变是什么意思| 肾气不足吃什么中成药| 快乐大本营为什么停播| 有什么好看的国漫| 爱屋及乌什么意思| 每天吃鸡蛋有什么好处和坏处| 解落三秋叶的解是什么意思| 乔顿男装属于什么档次| ken是什么意思| 心影饱满是什么意思| 植物都有什么| 枕戈待旦什么意思| 怀孕脚浮肿是什么原因引起的| 什么是预防医学| 飞黄腾达是什么生肖| 造化是什么意思| 大便变细是什么原因| 一什么十什么的成语| 橘红是什么| 流口水是什么病的前兆| 蓝天白云是什么意思| 心慌什么原因引起的| 春分是什么意思| 煊字五行属什么| 地域黑什么意思| 士加一笔是什么字| 人格是什么意思| h型高血压是什么意思| 用什么泡脚能减肥| 蠕动什么意思| 员级职称是什么意思| 2月出生的是什么星座| 脉细滑是什么意思| 翌日是什么意思| 太阳黑子是什么东西| 做爱女生什么感觉| 万能输血者是什么血型| 缠腰蛇是什么原因引起的| 肾尿盐结晶是什么意思| 胎儿胆囊偏小有什么影响| 什么植物和动物很像鸡| 头部出汗多吃什么药| 爱拍马屁的动物是什么生肖| 梦见爸爸去世预兆什么| 什么是软文| 人言轻微是什么意思| 一什么缸| 莫西沙星片主治什么病| 人性是什么意思| 婴儿坐高铁需要什么证件| 什么叫变应性鼻炎| 湿热吃什么中药| 网络诈骗打什么电话| 束带是什么| 蔬菜都有什么| pp材质是什么| 谷草转氨酶是什么意思| 怀孕早期需要注意什么| 为什么会得多囊| 肛门瘙痒涂什么药膏| 乾字五行属什么| 为什么打哈欠会流泪| 槲皮素是什么东西| 缺钙吃什么补钙最快| 李子有什么功效| 为什么会得梅毒| 牙龈有点发黑是什么原因| 关节退行性变是什么意思| 梅子是什么水果| 大龄补贴需要什么条件| 为什么会反胃想吐| 獭尾肝是什么意思| 洗衣机漏水是什么原因| 甲状腺是什么科| 拔火罐有什么好处| 今天什么年| 突然是什么意思| 为什么有些人显老| 嬴政姓什么| 梦到打架是什么意思| 半夜醒是什么原因| 女为读什么| 什么是双向抑郁| 过会是什么意思| 吃什么能生发| 冻豆腐炖什么好吃| 小苏打和食用碱有什么区别| 神神叨叨是什么意思| 大姨妈来了喝什么好| 牛犇是什么意思| 童养媳是什么意思| 5月1号是什么星座| 火鸡面为什么这么贵| 糖尿病人早餐吃什么| 女人吃榴莲有什么好处| 建设性意见是什么意思| 手指甲白是什么原因| 目赤肿痛吃什么药最好| 一个月来两次大姨妈是什么原因| 牙周炎吃什么药好| 子鼠是什么意思| 与五行属什么| 水当当是什么意思| 舌头疼是什么原因| 乳房有溢液是什么原因| 疱疹长什么样子图片| 血压什么时候最高| 碎银子是什么茶| sorona是什么面料| 凋谢是什么意思| 五月十六是什么星座| 老年人全身无力是什么原因| nk是什么意思| 股票融是什么意思| 又什么又什么的花| 肝病有什么反应| 割韭菜是什么意思| 乳腺看什么科室| 两个立念什么| 股票缺口是什么意思| 铁路12306什么时候放票| 意念是什么意思| 日本为什么偷袭珍珠港| 枭印什么意思| 艾灸是什么东西| 考生号是什么| 脚后跟痛是什么问题| 焦虑症吃什么药好得快| 副高是什么级别| 手指甲扁平是什么原因| 助理研究员是什么职称| 孔夫子搬家的歇后语是什么| 爱情和面包是什么意思| 护士学什么专业| 白蛋白是什么意思| 西四命是什么意思| 控制血糖吃什么食物| 什么时候绝经| 死忠粉是什么意思| 遗精是什么原因引起的| 社保指的是什么| 国家为什么不承认鬼神| 肝经湿热吃什么中成药| 孕妇羊水少吃什么补的快| 李子吃多了有什么坏处| 咽喉老有痰是什么原因| 湛江有什么好玩的| 荨麻疹看什么科| 助听器什么牌子好| 梦到考试是什么意思| 契丹族现在是什么族| 多囊卵巢综合症吃什么食物好| 贤者模式是什么意思| 请丧假需要什么证明| 早上出汗是什么原因| 白月光什么意思| 香港车牌号是什么样子| 格物穷理是什么意思| 小腿出汗是什么原因| 同房出血要做什么检查| 牙龈上火吃什么药| 头痛看什么科| 无事不登三宝殿什么意思| 舌根苔白厚腻是什么原因| 机警是什么意思| 消化内科是看什么病的| 乙醇是什么| 衤叫什么偏旁| 科学的尽头是什么| 胃炎什么症状| 一心向阳下一句是什么| 干部是什么意思| 背痛挂什么科| 突然晕厥是什么原因| 手腕长痣代表什么意思| 憨是什么意思| 青柠檬和黄柠檬有什么区别| 百度Jump to content

第十九届晋江“鞋博会”今开幕,注重国际化、创新性

From Wikipedia, the free encyclopedia
百度 今年,随着广东省海上风电进入高速发展期,新增海上风电开工建设容量预计将达365万千瓦。

The existence of Comet NEOWISE (here depicted as a series of red dots) was discovered by analyzing astronomical survey data acquired by a space telescope, the Wide-field Infrared Survey Explorer.

Data science is an interdisciplinary academic field[1] that uses statistics, scientific computing, scientific methods, processing, scientific visualization, algorithms and systems to extract or extrapolate knowledge from potentially noisy, structured, or unstructured data.[2]

Data science also integrates domain knowledge from the underlying application domain (e.g., natural sciences, information technology, and medicine).[3] Data science is multifaceted and can be described as a science, a research paradigm, a research method, a discipline, a workflow, and a profession.[4]

Data science is "a concept to unify statistics, data analysis, informatics, and their related methods" to "understand and analyze actual phenomena" with data.[5] It uses techniques and theories drawn from many fields within the context of mathematics, statistics, computer science, information science, and domain knowledge.[6] However, data science is different from computer science and information science. Turing Award winner Jim Gray imagined data science as a "fourth paradigm" of science (empirical, theoretical, computational, and now data-driven) and asserted that "everything about science is changing because of the impact of information technology" and the data deluge.[7][8]

A data scientist is a professional who creates programming code and combines it with statistical knowledge to summarize data.[9]

Foundations

[edit]

Data science is an interdisciplinary field[10] focused on extracting knowledge from typically large data sets and applying the knowledge from that data to solve problems in other application domains. The field encompasses preparing data for analysis, formulating data science problems, analyzing data, and summarizing these findings. As such, it incorporates skills from computer science, mathematics, data visualization, graphic design, communication, and business.[11]

Vasant Dhar writes that statistics emphasizes quantitative data and description. In contrast, data science deals with quantitative and qualitative data (e.g., from images, text, sensors, transactions, customer information, etc.) and emphasizes prediction and action.[12] Andrew Gelman of Columbia University has described statistics as a non-essential part of data science.[13] Stanford professor David Donoho writes that data science is not distinguished from statistics by the size of datasets or use of computing and that many graduate programs misleadingly advertise their analytics and statistics training as the essence of a data-science program. He describes data science as an applied field growing out of traditional statistics.[14]

Etymology

[edit]

Early usage

[edit]

In 1962, John Tukey described a field he called "data analysis", which resembles modern data science.[14] In 1985, in a lecture given to the Chinese Academy of Sciences in Beijing, C. F. Jeff Wu used the term "data science" for the first time as an alternative name for statistics.[15] Later, attendees at a 1992 statistics symposium at the University of Montpellier  II acknowledged the emergence of a new discipline focused on data of various origins and forms, combining established concepts and principles of statistics and data analysis with computing.[16][17]

The term "data science" has been traced back to 1974, when Peter Naur proposed it as an alternative name to computer science.[6] In 1996, the International Federation of Classification Societies became the first conference to specifically feature data science as a topic.[6] However, the definition was still in flux. After the 1985 lecture at the Chinese Academy of Sciences in Beijing, in 1997 C. F. Jeff Wu again suggested that statistics should be renamed data science. He reasoned that a new name would help statistics shed inaccurate stereotypes, such as being synonymous with accounting or limited to describing data.[18] In 1998, Hayashi Chikio argued for data science as a new, interdisciplinary concept, with three aspects: data design, collection, and analysis.[17]

Modern usage

[edit]

In 2012, technologists Thomas H. Davenport and DJ Patil declared "Data Scientist: The Sexiest Job of the 21st Century",[19] a catchphrase that was picked up even by major-city newspapers like the New York Times[20] and the Boston Globe.[21] A decade later, they reaffirmed it, stating that "the job is more in demand than ever with employers".[22]

The modern conception of data science as an independent discipline is sometimes attributed to William S. Cleveland.[23] In 2014, the American Statistical Association's Section on Statistical Learning and Data Mining changed its name to the Section on Statistical Learning and Data Science, reflecting the ascendant popularity of data science.[24]

The professional title of "data scientist" has been attributed to DJ Patil and Jeff Hammerbacher in 2008.[25] Though it was used by the National Science Board in their 2005 report "Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century", it referred broadly to any key role in managing a digital data collection.[26]

Data science and data analysis

[edit]
summary statistics and scatterplots showing the Datasaurus dozen data set
Example for the usefulness of exploratory data analysis as demonstrated using the Datasaurus dozen data set

Data analysis typically involves working with structured datasets to answer specific questions or solve specific problems. This can involve tasks such as data cleaning and data visualization to summarize data and develop hypotheses about relationships between variables. Data analysts typically use statistical methods to test these hypotheses and draw conclusions from the data.[27]

Data science involves working with larger datasets that often require advanced computational and statistical methods to analyze. Data scientists often work with unstructured data such as text or images and use machine learning algorithms to build predictive models. Data science often uses statistical analysis, data preprocessing, and supervised learning.[28][29]

Cloud computing for data science

[edit]
A cloud-based architecture for enabling big data analytics. Data flows from various sources, such as personal computers, laptops, and smart phones, through cloud services for processing and analysis, finally leading to various big data applications.

Cloud computing can offer access to large amounts of computational power and storage.[30] In big data, where volumes of information are continually generated and processed, these platforms can be used to handle complex and resource-intensive analytical tasks.[31]

Some distributed computing frameworks are designed to handle big data workloads. These frameworks can enable data scientists to process and analyze large datasets in parallel, which can reduce processing times.[32]

Ethical consideration in data science

[edit]

Data science involves collecting, processing, and analyzing data which often includes personal and sensitive information. Ethical concerns include potential privacy violations, bias perpetuation, and negative societal impacts.[33][34]

Machine learning models can amplify existing biases present in training data, leading to discriminatory or unfair outcomes.[35][36]

See also

[edit]

References

[edit]
  1. ^ Donoho, David (2017). "50 Years of Data Science". Journal of Computational and Graphical Statistics. 26 (4): 745–766. doi:10.1080/10618600.2017.1384734. S2CID 114558008.
  2. ^ Dhar, V. (2013). "Data science and prediction". Communications of the ACM. 56 (12): 64–73. doi:10.1145/2500499. S2CID 6107147. Archived from the original on 9 November 2014. Retrieved 2 September 2015.
  3. ^ Danyluk, A.; Leidig, P. (2021). Computing Competencies for Undergraduate Data Science Curricula (PDF). ACM Data Science Task Force Final Report (Report).
  4. ^ Mike, Koby; Hazzan, Orit (20 January 2023). "What is Data Science?". Communications of the ACM. 66 (2): 12–13. doi:10.1145/3575663. ISSN 0001-0782.
  5. ^ Hayashi, Chikio (1 January 1998). "What is Data Science ? Fundamental Concepts and a Heuristic Example". In Hayashi, Chikio; Yajima, Keiji; Bock, Hans-Hermann; Ohsumi, Noboru; Tanaka, Yutaka; Baba, Yasumasa (eds.). Data Science, Classification, and Related Methods. Studies in Classification, Data Analysis, and Knowledge Organization. Springer Japan. pp. 40–51. doi:10.1007/978-4-431-65950-1_3. ISBN 9784431702085.
  6. ^ a b c Cao, Longbing (29 June 2017). "Data Science: A Comprehensive Overview". ACM Computing Surveys. 50 (3): 43:1–43:42. arXiv:2007.03606. doi:10.1145/3076253. ISSN 0360-0300. S2CID 207595944.
  7. ^ Tony Hey; Stewart Tansley; Kristin Michele Tolle (2009). The Fourth Paradigm: Data-intensive Scientific Discovery. Microsoft Research. ISBN 978-0-9825442-0-4. Archived from the original on 20 March 2017.
  8. ^ Bell, G.; Hey, T.; Szalay, A. (2009). "Computer Science: Beyond the Data Deluge". Science. 323 (5919): 1297–1298. doi:10.1126/science.1170411. ISSN 0036-8075. PMID 19265007. S2CID 9743327.
  9. ^ Davenport, Thomas H.; Patil, D. J. (October 2012). "Data Scientist: The Sexiest Job of the 21st Century". Harvard Business Review. 90 (10): 70–76, 128. PMID 23074866. Retrieved 18 January 2016.
  10. ^ Emmert-Streib, Frank; Dehmer, Matthias (2018). "Defining data science by a data-driven quantification of the community". Machine Learning and Knowledge Extraction. 1: 235–251. doi:10.3390/make1010015.
  11. ^ "1. Introduction: What Is Data Science?". Doing Data Science [Book]. O’Reilly. Retrieved 3 April 2020.
  12. ^ Vasant Dhar (1 December 2013). "Data science and prediction". Communications of the ACM. 56 (12): 64–73. doi:10.1145/2500499. S2CID 6107147.
  13. ^ "Statistics is the least important part of data science ? Statistical Modeling, Causal Inference, and Social Science". statmodeling.stat.columbia.edu. Retrieved 3 April 2020.
  14. ^ a b Donoho, David (18 September 2015). "50 years of Data Science" (PDF). Retrieved 2 April 2020.
  15. ^ Wu, C. F. Jeff (1986). "Future directions of statistical research in China: a historical perspective" (PDF). Application of Statistics and Management. 1: 1–7. Retrieved 29 November 2020.
  16. ^ Escoufier, Yves; Hayashi, Chikio; Fichet, Bernard, eds. (1995). Data science and its applications. Tokyo: Academic Press/Harcourt Brace. ISBN 0-12-241770-4. OCLC 489990740.
  17. ^ a b Murtagh, Fionn; Devlin, Keith (2018). "The Development of Data Science: Implications for Education, Employment, Research, and the Data Revolution for Sustainable Development". Big Data and Cognitive Computing. 2 (2): 14. doi:10.3390/bdcc2020014.
  18. ^ Wu, C. F. Jeff. "Statistics=Data Science?" (PDF). Retrieved 2 April 2020.
  19. ^ Davenport, Thomas (1 October 2012). "Data Scientist: The Sexiest Job of the 21st Century". Harvard Business Review. Retrieved 10 October 2022.
  20. ^ Miller, Claire (4 April 2013). "Data Science: The Numbers of Our Lives". New York Times. New York City. Retrieved 10 October 2022.
  21. ^ Borchers, Callum (11 November 2015). "Behind the scenes of the 'sexiest job of the 21st century'". Boston Globe. Boston. Retrieved 10 October 2022.
  22. ^ Davenport, Thomas (15 July 2022). "Is Data Scientist Still the Sexiest Job of the 21st Century?". Harvard Business Review. Retrieved 10 October 2022.
  23. ^ William S. Cleveland (April 2001). "Data Science: an Action Plan for Expanding the Technical Areas of the Field of Statistics". International Statistical Review. 69 (1): 21–26. doi:10.1111/J.1751-5823.2001.TB00477.X. ISSN 0306-7734. JSTOR 1403527. S2CID 39680861. Zbl 1213.62003. Wikidata Q134576907.
  24. ^ Talley, Jill (1 June 2016). "ASA Expands Scope, Outreach to Foster Growth, Collaboration in Data Science". Amstat News. American Statistical Association.. In 2013 the first European Conference on Data Analysis (ECDA2013) started in Luxembourg the process which founded the European Association for Data Science (EuADS) www.euads.org in Luxembourg in 2015.
  25. ^ Davenport, Thomas H.; Patil, D. J. (1 October 2012). "Data Scientist: The Sexiest Job of the 21st Century". Harvard Business Review. No. October 2012. ISSN 0017-8012. Retrieved 3 April 2020.
  26. ^ "US NSF – NSB-05-40, Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century". www.nsf.gov. Retrieved 3 April 2020.
  27. ^ James, Gareth; Witten, Daniela; Hastie, Trevor; Tibshirani, Robert (29 September 2017). An Introduction to Statistical Learning: with Applications in R. Springer.
  28. ^ Provost, Foster; Tom Fawcett (1 August 2013). "Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking". O'Reilly Media, Inc.
  29. ^ Han, Kamber; Pei (2011). Data Mining: Concepts and Techniques. ISBN 9780123814791.
  30. ^ Hashem, Ibrahim Abaker Targio; Yaqoob, Ibrar; Anuar, Nor Badrul; Mokhtar, Salimah; Gani, Abdullah; Ullah Khan, Samee (2015). "The rise of "big data" on cloud computing: Review and open research issues". Information Systems. 47: 98–115. doi:10.1016/j.is.2014.07.006.
  31. ^ Qiu, Junfei; Wu, Qihui; Ding, Guoru; Xu, Yuhua; Feng, Shuo (2016). "A survey of machine learning for big data processing". EURASIP Journal on Advances in Signal Processing. 2016 (1). doi:10.1186/s13634-016-0355-x. ISSN 1687-6180.
  32. ^ Armbrust, Michael; Xin, Reynold S.; Lian, Cheng; Huai, Yin; Liu, Davies; Bradley, Joseph K.; Meng, Xiangrui; Kaftan, Tomer; Franklin, Michael J.; Ghodsi, Ali; Zaharia, Matei (27 May 2015). "Spark SQL: Relational Data Processing in Spark". Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data. ACM. pp. 1383–1394. doi:10.1145/2723372.2742797. ISBN 978-1-4503-2758-9.
  33. ^ Floridi, Luciano; Taddeo, Mariarosaria (28 December 2016). "What is data ethics?". Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences. 374 (2083): 20160360. Bibcode:2016RSPTA.37460360F. doi:10.1098/rsta.2016.0360. ISSN 1364-503X. PMC 5124072. PMID 28336805.
  34. ^ Mittelstadt, Brent Daniel; Floridi, Luciano (2016). "The Ethics of Big Data: Current and Foreseeable Issues in Biomedical Contexts". Science and Engineering Ethics. 22 (2): 303–341. doi:10.1007/s11948-015-9652-2. ISSN 1353-3452. PMID 26002496.
  35. ^ Barocas, Solon; Selbst, Andrew D (2016). "Big Data's Disparate Impact". California Law Review. doi:10.15779/Z38BG31 – via Berkeley Law Library Catalog.
  36. ^ Caliskan, Aylin; Bryson, Joanna J.; Narayanan, Arvind (14 April 2017). "Semantics derived automatically from language corpora contain human-like biases". Science. 356 (6334): 183–186. arXiv:1608.07187. Bibcode:2017Sci...356..183C. doi:10.1126/science.aal4230. ISSN 0036-8075.
海兔是什么动物 湿热带下是什么意思 真丝丝绒是什么面料 儿童鸡胸挂什么科 bgb是什么意思
孕早期头晕是什么原因 清酒是什么酒 牙龈出血用什么牙膏 二胎政策什么时候开放的 卤水是什么水
什么血型招蚊子咬 虐狗什么意思 骨质疏松有什么症状表现 紫砂壶泡什么茶最好 杨玉环属什么生肖
ts什么意思网络上 缪斯什么意思 七月五号是什么星座 衣原体感染有什么症状 口腔疱疹用什么药
cot是什么hcv9jop2ns2r.cn 梦见狗咬手是什么意思hcv9jop4ns3r.cn 1948年是什么年helloaicloud.com 孤独症是什么hcv8jop5ns3r.cn 疝气吃什么药效果好hcv9jop2ns2r.cn
棉纶是什么面料hcv9jop4ns9r.cn 练字用什么笔好hcv9jop2ns9r.cn 癫痫病吃什么药luyiluode.com opple是什么牌子hcv9jop8ns1r.cn 皮瓣手术是什么意思hcv8jop5ns7r.cn
京东什么时候优惠最大hcv7jop9ns4r.cn 肌酐测定低是什么意思hcv8jop3ns4r.cn 初秋的天冰冷的夜是什么歌hcv8jop3ns3r.cn 什么什么大地hcv9jop8ns0r.cn 33朵玫瑰花代表什么hcv9jop0ns2r.cn
睡醒口干舌燥是什么原因hcv9jop0ns3r.cn 晓五行属什么hcv8jop8ns7r.cn 防蓝光眼镜有什么用hcv9jop6ns3r.cn 朝霞不出门晚霞行千里是什么意思hcv9jop6ns6r.cn 什么贤什么能jinxinzhichuang.com
百度