《数据挖掘与数据分析数据可视化试题.docx》由会员分享,可在线阅读,更多相关《数据挖掘与数据分析数据可视化试题.docx(18页珍藏版)》请在第壹文秘上搜索。
1、数据挖掘与数据分析,数据可视化试题1. DataMiningisalsoreferredtoasdataanalysisdatadiscoverydatarecoveryDatavisualization2. DataMiningisamethodandtechniqueinclusiveofdataanalysis.datadiscoveryDatavisualizationdatarecovery3. InwhichstepofDataScienceconsumeAlmost80%oftheworkperiodoftheprocedure.AccumulatingthedataAnalyz
2、ingthedataWranglingthedataRecapitulationoftheData4. WhichStepofDataScienceallowsthemodeltoconsistentlyimproveandprovidepunctualperformanceanddeliverapproximateresults.WranglingthedataAccumulatingthedataAnalyzingthedata5. WhichtoolofDataScienceisrobustmachinelearninglibrary,whichallowstheimplementati
3、onofdeeplearning7algorithms.STableauD3.jsApacheSparkTensorFlow6. WhatisthemainaimofDataMining?toobtaindatafromalessnumberofsourcesandtotransformitintoamoreusefulversionofitself.toobtaindatafromalessnumberofsourcesandtotransformitintoalessusefulversionofitself.toobtaindatafromagreatnumberofsourcesand
4、totransformitintoalessusefulversionofitself.toobtaindatafromagreatnumberofsourcesandtotransformitintoamoreusefulversionofitself.7. Inwhichstepofdataminingtheirrelevantpatternsareeliminatedtoavoidcluttering?CleaningthedataEvaluatingthedataConversionofthedataIntegrationofdata8. DataSciencetismainlyuse
5、dforpurposes.Dataminingismainlyusedforpurposes.scientific,businessbusiness,scientificscientific,scientificNone9. Pandasisaonedimensionallabeledarraycapableofholdingdataofanytype(integer,string,float,pythonobjects,etc.).SeriesFramePanelNone10. HowmanyprincipalcomponentsPandasDataFrameconsistsof?42131
6、1. Importantdatastructureofpandasis/areSeriesDataFrameBoth.Noneoftheabove12. Whichofthefollowingcommandisusedtoinstallpandas?pipinstallpandasinstallpandaspippandas13. Whichofthefollowingfunction/methodhelptocreateSeries?series()Series()(CreateSeries()Noneoftheabove14. NumPYstandsfor?NumberingPythonN
7、umberInPythonNumericalPythonNoneOftheabove15. Whichofthefollowingisnotcorrectsub-packagesofSciPy?scipy.integratescipy.sourcescipy.interpolatescipy.signal16. HowtoimportConstantsPackageinSciPy?importscipy.constantsfromscipy.constantsimportscipy.constants.packagefromscipy.constants.package17. involves
8、lookingatanddescribingthedatasetfromdifferentanglesandthensummarizingit?DataFrameDataVisualizationEDAiAlloftheabove18. whatinvolvesthepreparationofdatasetsforanalysisbyremovingirregularitiesinthedatasothattheseirregularitiesdonotaffectfurtherstepsintheprocessofdataanalysisandmachinelearningmodelbuil
9、ding?DataAnalysisEDA!DataFrameNoneoftheabove19. WhatisnotUtilityofEDA?MaximizetheinsightinthedatasetDetectoutliersandanomaliesVisualizationofdataTestunderlyingassumptions20. whatcanhamperthefurtherstepsinthemachinelearningmodelbuildingprocessIfnotperformedproperly?RecapitulationoftheDataAccumulating
10、thedataEDA(正确答案)Noneoftheabove21. WhichplotforEDAtocheckthedependencybetweentwovariables?HistogramsScatterplotsMapsTimeseriesplots22. Whatfunctionwilltellyouthetoprecordsinthedataset?shapehead(正确答案)showalloftheaboce23. whattypeofdataisusefulforinternalpolicymakingandbusinessstrategybuildingforanorga
11、nization?publicdataprivatedatabothNoneoftheabove24. Thefunctioncan“fillinNAvalueswithnon-nulldata?headfillnashapealloftheabove25. Ifyouwanttosimplyexcludethemissingvalues,thenwhatfunctionalongwiththeaxisargumentwillbeuse?llnareplacedropnaisnull26. WhichofthefollowingattributeofDataFrameisusedtodispl
12、aydatatypeofeachcolumninDataFrame?DtypesDTypesdlypesdatatypes27. WhichofthefollowingfunctionisusedtoloadthedatafromtheCSVfileintoaDataFrame?read.csv()readcsv()read_csv()(正确涔案)Read_csv()28. howtoDisplayfirstrowofdataframeDF?print(DF.head(1)print(DF0:1)print(DF.iloc0:1)Alloftheabove29. Spreadfunctioni
13、sknownasinspreadsheets?pivotunpivotcastorder30. extractasubsetofrowsfromadataframbasedonlogicalconditions?renamefiltersetsubset31. WccanshifttheDataFrame,sindexbyacertainnumberofperiodsusingtheMethod?melt()merge()tail()shift()俚确答案)32. WecanjoinmeltedDataFramesintooneAnalyticalBaseTableusingthefuncti
14、on.join()append()merge()truncate()33. Whatmethosisusedtoconcatenatedatasetsalonganaxis?concatenate()Oz-*Phi、(正确答案)addOmerge()34. Rowscanbeifthenumberofmissingvaluesisinsignificant,asthiswouldnotimpacttheoverallanalysisresults.deletedupdatedaddedall35. Thereisaspecificreasonbehindthemissingvalue.What
15、standsforMissingnotatrandomMCARMARMNARNoneoftheabove36. Whileplottingdata,somevaluesofonevariablemaynotliebeyondtheexpectedrange,butwhenyouplotthedatawithsomeothervariable,thesevaluesmayliefarfromtheexpectedvalue.Identifythetypeofoutliers?UnivariateoutliersMultivariateoutliers1;)ManyVariateOutlinersNoneoftheabove37. ifnumericvaluesarestoredasstrings,thenitwouldnotbepossibletoCalculatemetricssu