版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進(jìn)行舉報或認(rèn)領(lǐng)
文檔簡介
DataAnalyticsinAccountingandBusinessChapter1Wherewearenow1.DataAnalytics2.DataPreparationandCleaning3.ModelingandEvaluation4.Visualization5.TheModernAudit6.AuditAnalytics7.KeyPerformanceIndicators8.FinancialStatementAnalyticsObjectivesLO1-1Whatisdataanalytics?LO1-2Howdoesdataanalyticsaffectbusiness?LO1-3Whydoesdataanalyticsmattertoaccountants?LO1-4Whatisthedataanalyticsprocess?LO1-5Whatdataanalyticskillsdoaccountantsneed?LO1-6HandsonexampleoftheIMPACTmodel.IntheIMPACTcycle,we’renowgoingtolookatIdentifyingtheQuestions.(We’lldiscussthistoday.)IdentifythequestionsMasterthedataPerformtestplanAddressandrefineresultsCommunicateinsightsTrackoutcomesExhibit1-1TheIMPACTCycleWhatisDataAnalytics?LO1-1DataAnalyticsandBigDataDataAnalyticsistheprocessofevaluatingdatawiththepurposeofdrawingconclusionstoaddressbusinessquestions.EffectiveDataAnalyticsprovidesawaytosearchthroughlargestructuredandunstructureddatatoidentifyunknownpatternsorrelationships.BigDatareferstodatasetswhicharetoolargeandcomplextobeanalyzedtraditionally.Rememberthe3V’s:VolumereferstosizeVelocityreferstofrequencyVarietyreferstodifferenttypesDataAnalyticsandBigDataRememberthe3V’s:VolumereferstosizeVelocityreferstofrequencyVarietyreferstodifferenttypesQ:Howcouldabankusedataanalyticstounderstandcustomercreditworthiness?Howdoesdataanalyticsaffectbusiness?LO1-2Bythenumbers:85%ofCEOsputahighvalueonDataAnalytics.80%ofCEOsplacedataminingandanalysisasthesecond-mostimportantstrategictechnology.BusinessanalyticstopsCEO’slistofpriorities.DataAnalyticscouldgenerateupto$3trillioninvalueperyear.Q.Howcoulddataanalyticsbeusedtoreduceacompany’sovertimecosts?Whydoesdataanalyticsmattertoaccountants?LO1-3Howdoesdataanalyticsaffectauditing?Dataanalyticswillenhanceauditquality.Dataanalyticsenablesenhancedaudits,expandedservices,andaddedvaluetoclients.Externalauditorswillstayengagedbeyondtheaudit.Howdoesdataanalyticsaffectfinancialreporting?Betterestimatesofcollectability,write-downs,etc.ManagerscanbetterunderstandthebusinessenvironmentthroughsocialmediaIdentifyrisksandopportunitiesthroughanalysisofInternetsearchesHowdoesdataanalyticsaffecttaxes?TaxstrategyandplanningUnderstandingoftaxconsequencesofinternationaltransactions,investment,mergersandacquisitionsBetterorganizationoftaxtablesandothertaxdataQ.Whatpatternsmightanauditorfindthroughdataanalytics?Whatisthedataanalyticsprocess?1-4TheIMPACTmodelIdentifythequestionsMasterthedataPerformthetestplanAddressandrefineresultsCommunicateinsightsTrackoutcomesIdentifythequestionsMasterthedataPerformtestplanAddressandrefineresultsCommunicateinsightsTrackoutcomesExhibit1-1TheIMPACTCycleStep1:IdentifytheQuestionsUnderstandthebusinessproblemsthatneedtobeaddressed.Areemployeescircumventinginternalcontrolsoverpayments?Arethereanysuspicioustravelandentertainmentexpenses?Howcanweincreasetheamountofadd-onsalesofadditionalgoodstoourcustomers?Areourcustomerspayingusinatimelymanner?Howcanwepredicttheallowanceforloanlossesforourbankloans?Howcanwefindtransactionsthatareriskyintermsofaccountingissues?Whoauthorizeschecksabove$100,000?Howcanerrorsbeidentified?Step2:MastertheDataKnowwhatdataareavailableandhowtheyrelatetotheproblem.InternalERPsystemsExternalnetworksanddatawarehousesDatadictionariesExtraction,transformation,andloadingDatavalidationandcompletenessDatanormalizationDatapreparationandscrubbingTransactionsTransactionID[PK]CustomerID[FK]DateDescriptionAmountCustomerCustomerID[PK]NameAddressCityStateStep3:PerformtheTestPlanSelectanappropriatemodeltofindatargetvariable.ClassificationRegressionSimilaritymatchingClusteringCo-occurrencegroupingProfilingStep4:AddressandRefineResultsIdentifyissueswiththeanalyses,possibleissues,andrefinethemodelAskfurtherquestionsExplorethedataRerunanalysesStep5:CommunicateInsightsCommunicateeffectivelyusingclearlanguageandvisualizations:DashboardsStaticreportsSummariesStep6:TrackOutcomesFollowupontheresultsoftheanalysis.Howfrequentlyshouldtheanalysisbeperformed?Havetheanalyticschanged?Whatarethetrends?Q.Let’ssaywearetryingtopredicthowmuchmoneycollegestudentsspendonfastfoodeachweek.Whatwouldbetheresponse,ordependent,variable?Whatwouldbeexamplesofindependentvariables?Whatdataanalyticskillsdoaccountantsneed?LO1-5Accountantsneedtobeableto:Articulatebusinessproblems.Communicatewithdatascientists.Drawappropriateconclusions.Presentresultsinanaccessiblemanner.Developananalyticsmindset.Aswellasbecomfortablewith:DatascrubbinganddatapreparationDataqualityDescriptivedataanalysisDataanalysisthroughdatamanipulationDefineandaddressproblemsthroughstatisticalanalysisDatavisualizationanddatareportingQ.WhatotherskillsmightbeusefulinperformingDataAnalytics?Hands-onExampleoftheIMPACTModelLO1-6Step1:IdentifytheQuestionAssumeyouwanttogetaloantopayoffsomecreditcarddebt.LendingClubisapeer-to-peerlenderthatconnectsindividuallenderswithborrowers.UsetheIMPACTmodeltodeterminewhetheryou’relikelytogetaloan.“Givenmyborrowerprofile,canIexpectLendingClubtoextendaloantome?”Whatotherquestionsmightyouask?Step2:MastertheDataLendingClubisaU.S.-based,peer-to-peerlendingcompany,headquarteredinSanFrancisco,California.LendingClubfacilitatesbothborrowingandlendingbyprovidingaplatformforunsecuredpersonalloansbetween$1,000and$35,000.Theloanperiodisforeither3or5years.Dataavailable:Approvedloans(LoanStats)Rejectedloanstats(RejectStats)Step2:MastertheDataPersonalloanshavegrownsince2010.Themajorityareforrefinancing.Exhibit1-4LendingClubStatisticsExhibit1-5LendingClubStatisticsbypurposeStep2:MastertheDataRejectedStatsDataDictionary SampledatafromRejectedStatsAmountRequestedApplicationDateLoanTitleRisk_ScoreDebt-To-IncomeRatioZipCodeStateEmploymentLength10005/26/2007WeddingCoveredbutNoHoneymoon69310%481xxNM4years10005/26/2007ConsolidatingDebt70310%010xxMA<1year110005/27/2007Wanttoconsolidatemydebt71510%212xxMD1year60005/27/2007waksman69838.64%017xxMA<1year15005/27/2007mdrigo5099.43%209xxMD<1year150005/27/2007Trinfiniti6450%105xxNY3years100005/27/2007NOTIFYiInc69310%210xxMD<1year39005/27/2007ForJustin.70010%469xxIN2years30005/28/2007title?69410%808xxCO4years25005/28/2007timgerst57311.76%407xxKY4years39005/28/2007needtoconsolidate71010%705xxLA10+years10005/28/2007sixstrings68010%424xxKY1year30005/28/2007bmoore511068810%190xxPA<1year15005/28/2007MHarkins70410%189xxPA3yearsRejectStatsFileDescriptionAmountRequestedThetotalamountrequestedbytheborrowerApplicationDateThedatewhichtheborrowerappliedLoanTitleTheloantitleprovidedbytheborrowerRisk_ScoreForapplicationspriortoNovember5,2013theriskscoreistheborrower'sFICOscore.ForapplicationsafterNovember5,2013theriskscoreistheborrower'sVantagescore.Debt-To-IncomeRatioAratiocalculatedusingtheborrower’stotalmonthlydebtpaymentsonthetotaldebtobligations,excludingmortgageandtherequestedLCloan,dividedbytheborrower’sself-reportedmonthlyincome.ZipCodeThefirst3numbersofthezipcodeprovidedbytheborrowerintheloanapplication.StateThestateprovidedbytheborrowerintheloanapplicationEmploymentLengthEmploymentlengthinyears.Possiblevaluesarebetween0and10where0meanslessthanoneyearand10meanstenormoreyears.PolicyCodepubliclyavailablepolicy_code=1
newproductsnotpubliclyavailablepolicy_code=2Step3:PerformtheTestPlanPerformthreeanalysestopredictwhetherwereceivealoan:1.Thedebt-to-income(DTI)ratiosandnumberofrejectedloans2.Thelengthofemploymentandnumberofrejectedloans3.Thecredit(orrisk)scoreandnumberofrejectedloansStep3:PerformtheTestPlanForDTI,wesetbuckets:High=debt>20%ofincomeMid=debtis10-20%ofincomeLow=debt<10%ofincomeHereweseeaPivotTablewithresultsonRejectStatsStep3:PerformtheTestPlanForemploymentlength,wesetbucketsonnumberofyears.HereweseeaPivotTablewithresultsonRejectStatsStep3:PerformtheTestPlanForcreditscore,wesetbuckets:Excellent:800-850Verygood:750-799Good:700-749Fair:650-699Poor:600-649Verybad:300-599Step4:AddressandRefineResultsFromthePivotTableanalysis,wefindthatoftherejectedloans:82%haveeitherverybad,poor,orfaircredit48%hadahighDTIratio76%hadacredithistoryofoneyearorlessStep4:AddressandRefineResultsIfwelookatinteractionsofcreditscore
&DTI&employmentlengthinaPivotTable,weseetheyarefairlypredictive.Only89or645,414loanswererejectedwithfromthetopbucketsfromeach.Step5:CommunicateInsightsThePivotTablesprovideasimplevisual.Additionalvisualizationsortoolsmayprovidequickanalysisbythoseevaluatingtheloans.AnothergoalistosharetheresultsinplainEnglish:“IfIhavegoodcredit,lowdebt-to-income,andalongemploymentlength,itisverylikelythatmyloanwillbeaccepted.”Step6:TrackOutcomesExtendingthisanalysistofutureperiodswillhelpusdeterminewhetherthesefactorsholdtrueorifthereissomeshiftinthefuture.Weattempttousepastperformancetopredictfutureresults,butthatmaynotalwaysholdtrue.Whenfactorschange,werepeattheIMPACTcycle.Q.WouldyouexpectloansfromCaliforniatobemoreorlesslikelyapproved?Howcouldyoutestthat?SummaryWithdataallaroundus,businessesandaccountantsarelookingtoDataAnalyticstoextractthevaluethatthedatamightpossess.DataAnalyticsischangingtheauditandthewaythataccountantslookforrisk.Now,auditorscanconsider100percentofthetransactionsintheiraudittesting.Itisalsohelpfulinfindingtheanomalousorunusualtransactions.DataAnalyticsisalsochangingthewayfinancialaccounting,managerialaccounting,andtaxesaredoneatacompany.TheIMPACTcycleisameansofdoingDataAnalyticsthatgoesallthewayfromidentifyingthequestion,tomasteringthedata,toperformingdataanalysesandcommunicatingresults.Itisrecursiveinnature,suggestingthatasquestionsareaddressed,newimportantquestionsmayemergethatcanbeaddressedinasimilarway.Eightdataapproachesaddressdifferentwaysoftestingthedata:classification,regression,similaritymatching,clustering,co-occurrencegrouping,profiling,linkprediction,anddatareduction.Theseareexplainedinmoredetailinchapter3.Dataanalyticskillsneededbyanalytic-mindedaccountantsarespecifiedandareconsistentwiththeIMPACTcycle,includingthefollowing:Developananalyticsmindset.
Datascrubbinganddatapreparation.
Dataquality.
Descriptivedataanalysis.
Dataanalysisthroughdatamanipulation.
Defineandaddressproblemsthroughstatisticaldataanalysis.
Datavisualizationanddatareporting.DataPreparation
andCleaningChapter2Wherewearenow1.DataAnalytics2.DataPreparationandCleaning3.ModelingandEvaluation4.Visualization5.TheModernAudit6.AuditAnalytics7.KeyPerformanceIndicators8.FinancialStatementAnalyticsObjectivesLO2-1Howaredatausedandstoredintheaccountingcycle?LO2-2Howaredatastoredinrelationaldatabases?LO2-3Whatdoesitmeantoextract,transform,andload?IntheIMPACTcycle,we’regoingtolookatMasteringtheData.IdentifythequestionsMasterthedataPerformtestplanAddressandrefineresultsCommunicateinsightsTrackoutcomesExhibit1-1TheIMPACTCycleHowaredatausedandstoredintheaccountingcycle?LO2-1Understandthedatabylookingathowitisorganized.Datacanbefoundthroughoutvarioussystems.Inmostcases,youneedtoknowwhichtablesandattributescontaintherelevantdata.UnifiedModelingLanguage(UML)isonewaytounderstanddatabases.FGI_ProductProduct_Code[PK]Product_Description…Sales_SubsetSales_Order_ID[PK]Product_Code[FK]Customer_ID[FK]…CustomerCustomer_ID[PK]Customer_Name…Howaredatastoredinrelationaldatabases?LO2-2Relationaldatabasesensurethatdata:Arecomplete,orincludealldata.Aren’tredundant,sotheydon’ttakeuptoomuchspace.Followbusinessrulesandinternalcontrols.Aidcommunicationandintegrationofbusinessprocesses.Therearefourtypesofattributes.Primarykeysareuniqueidentifiers.Foreignkeysareattributesthatpointtoaprimarykeyinanothertable.Compositekeysareacombinationoftwoforeignkeysusedforlineitems.Descriptiveattributesincludeeverythingelse.SupplierTableSupplierIDSupplierNameSupplierAddressSupplierType1NorthernBreweryHomebrewSupply6021LyndaleAveS12HopsDirectLLC686GreenValleyRoad13TheHomeBrewery455E.TownshipSt.14ThePayrollCompany408N.WaltonBlvd2Examplesoftwotables,attributes,anddata.NoticethePK-FKrelationship.PurchaseOrderTablePONo.DateCreatedByApprovedBySupplierID(FK)178711/1/2017100110101178811/1/2017100510102178911/8/2017100210101179011/15/2017100510101SupplierTableSupplierID(PK)SupplierNameSupplierAddressSupplierType1NorthernBreweryHomebrewSupply6021LyndaleAveS12HopsDirectLLC686GreenValleyRoad13TheHomeBrewery455E.TownshipSt.14ThePayrollCompany408N.WaltonBlvd2Datadictionariesdefinewhatdataareacceptable.Foreachattribute,welearn:Whattypeofkeyitis.Whatdataarerequired.Whatdatacanbestoredinit.Howmuchdataisstored.SupplierTableDataDictionaryPrimaryorForeignKey?RequiredAttributeNameDescriptionDataTypeDefaultValueFieldSizeNotesPKYSupplierIDUniqueIdentifierforeachSupplier
Numbern/a10
NSupplierNameFirstandLastNameShortTextn/a30
FKNSupplierTypeTypeCodeforDifferentSupplierCategories
NumberNull101:Vendor2:MiscQ.Whatisthepurposeoftheprimarykey?Aforeignkey?Anon-keyattribute?Whatdoesitmeantoextract,transform,andload?LO2-2TheRequestingdataisaniterativepracticeinvolving5steps:Step1:Determinethepurposeandscopeofthedatarequest.Step2:Obtainthedata.Step3:Validatethedataforcompletenessandintegrity.Step4:Cleanthedata.Step5:Loadthedatafordataanalysis.Step1:DeterminethepurposeandscopeofthedatarequestAskafewquestionsbeforebeginningtheprocess:Whatisthepurposeofthedatarequest?Whatdoyouneedthedatatosolve?Whatbusinessproblemwillitaddress?Whatriskexistsindataintegrity(e.g.,reliability,usefulness)?Whatisthemitigationplan?Whatotherinformationwillimpactthenature,timing,andextentofthedataanalysis?Step2:ObtainthedataHowwilldataberequestedand/orobtained?Doyouhaveaccesstothedatayourself,ordoyouneedtorequestadatabaseadministratorortheinformationsystemsdepartmenttoprovidethedataforyou?Ifyouneedtorequestthedata,isthereastandarddatarequestformthatyoushoulduse?Fromwhomdoyourequestthedata?Wherearethedatalocatedinthefinancialorotherrelatedsystems?Whatspecificdataareneeded(tablesandfields)?Whattoolswillbeusedtoperformdataanalytictestsorproceduresandwhy?Step2:ObtainthedataThereareacoupleoptions:ObtaindatathroughadatarequesttotheITdepartment.Obtaindatayourself.ExampleStandardDataRequestFormSECTION1:REQUESTDETAILSRequestorName:RequestorContact
Number:RequestorEmailAddress:Pleaseprovideadescriptionoftheinformationneeded(indicatewhichtablesandwhichfieldsyourequire):Whatwilltheinformationbeusedfor?Frequency(circleone)One-OffAnnuallyTermlyOther:___________Formatyouwishthedatatobedeliveredin(circleone):Spreadsheet
WordDocumentTextFile
Other:____________RequestDate:RequiredDate:IntendedAudience:Customer(ifnotrequestor):ExampleStandardDataRequestFormSECTION2:TOBECOMPLETEDBYINFORMATIONSYSTEMSDEPARTMENTRequestNumberDateReceivedReceivedbyAssignedtoInitialreviewcomments
(discussionwithclient—revisionsrequired?agreementtoproceed?etc.)Workinprogresscomments
(additionalnotesandcommentsduringproductionofdata)SECTION3:COMPLETIONDETAILSDateCompleted
DateProvidedRevisionsRequiredFeedbackfromclient
(ifapplicable)ObtainthedatayourselfIfyouhavedirectaccesstoadatawarehouse,youcanuseSQLandothertoolstopullthedatayourself.Identifythetablesthatcontaintheinformationyouneed.Youcandothisbylookingthroughthedatadictionaryortherelationshipmodel.Identifywhichattributes,specifically,holdtheinformationyouneedineachtable.Identifyhowthosetablesarerelatedtoeachother.Step3:ValidatethedataforcompletenessandintegrityChancesarethedatayourequestisn’tcomplete.Beforeyoubegin,doalittleworktomakesureyourdataarevalid:ComparethenumberofrecordsComparedescriptivestatisticsfornumericfieldsValidateDate/TimefieldsComparestringlimitsfortextfieldsStep4:CleanthedataOnceyouhavevaliddata,thereisstillsomeworkthatneedstobedonetomakesureitisconsistentandreadyforanalysis:RemoveheadingsorsubtotalsCleanleadingzeroesandnonprintablecharactersFormatnegativenumbersCorrectinconsistenciesacrossdata,ingeneralStep5:LoadthedatafordataanalysisFinally,youcannowimportyourdataintothetoolofyourchoiceandexpectthefunctionstoworkproperly.Q.Whatarefourcommonissueswithdatathatmustbefixedbeforeanalysiscantakeplace?SummaryThefirststepintheIMPACTcycleistoidentifythequestionsthatyouintendtoanswerthroughyourdataanalysisproject.Onceadataanalysisproblemorquestionhasbeenidentified,thenextstepintheIMPACTcycleismasteringthedata,whichcanbebrokendowntomeanobtainingthedataneededandpreparingitforanalysis.Inordertoobtaintherightdata,itisimportanttohaveafirmgraspofwhatdataareavailabletoyouandhowthatinformationisstored.Dataareoftenstoredinarelationaldatabase,whichhelpstoensurethatanorganization’sdataarecompleteandtoavoidredundancy.Relationaldatabasesaremadeupoftableswithuniquelyidentifiedrecords(thisisdonethroughprimarykeys)andarerelatedthroughtheusageofforeignkeys.Toobtainthedata,youwilleitherhaveaccesstoextractthedatayourselforyouwillneedtorequestthedatafromadatabaseadministratorortheinformationsystemsteam.Ifthelatteristhecase,youwillcompleteadatarequestform,indicatingexactlywhichdatayouneedandwhy.Onceyouhavethedata,theywillneedtobevalidatedforcompletenessandintegrity—thatis,youwillneedtoensurethatallofthedatayouneedwereextractedandthatalldataarecorrect.Sometimeswhendataareextracted,someformattingorsometimesevenentirerecordswillgetlost,resultingininaccuracies.Correctingtheerrorsandcleaningthedataisanintegralstepinmasteringthedata.Finally,afterthedatahavebeencleaned,theremaybeonelaststepofmasteringthedata,whichistoloadthemintothetoolthatwillbeusedforanalysis.Often,thecleaningandcorrectingofdataoccurinExcelandtheanalysiswillalsobedoneinExcel.Inthiscase,thereisnoneedtoloadthedataelsewhere.However,ifyouintendtodomorerigorousstatisticalanalysisthanExcelprovides,orifyouintendtodomorerobustdatavisualizationthancanbedoneinExcel,itmaybenecessarytoloadthedataintoanothertoolfollowingthetransformationprocess.ModelingandEvaluationChapter3Wherewearenow1.DataAnalytics2.DataPreparationandCleaning3.ModelingandEvaluation4.Visualization5.TheModernAudit6.AuditAnalytics7.KeyPerformanceIndicators8.FinancialStatementAnalyticsObjectivesLO3-1Whatdoyouneedtoknowaboutdatamodels?LO3-2Whataresomeunsupervisedandsupervisedapproaches?LO3-3Howdoyouperformprofiling?LO3-4Howdoyouperformdatareduction?LO3-5Howdoyouperformregression?LO3-6Howdoyouperformclassification?LO3-7Howdoyouperformclustering?IntheIMPACTcycle,we’renowgoingtolookatPerformingtheTestPlanandAddressingtheResults.IdentifythequestionsMasterthedataPerformtestplanAddressandrefineresultsCommunicateinsightsTrackoutcomesExhibit1-1TheIMPACTCycleWhatdoyouneedtoknowaboutdatamodels?LO3-1Whatdoyouneedtoknowaboutdatamodels?Atargetisanexpectedattributeorvaluethatwewanttoevaluate.Example:FraudscoreInterestrateAclassisamanuallyassignedcategoryappliedtoarecordbasedonanevent.Example:Accept/RejectFraud/NotfraudWhatdoyouneedtoknowfirst?Anunsupervisedapproachisusedwhenyoudon’thaveaspecificquestion.Example:“Doourvendorsformnaturalgroupsbasedonsimilarattributes?”Asupervisedapproach
isusedwhenyouaretryingtopredictafutureoutcomebasedonhistoricaldata.Example:“Willanewvendorshipalargeorderontime?”Whataresomeunsupervisedapproaches?Clustering
–findundiscoverednaturalgroupingsinthedataCo-occurrencegrouping–eventsthathappentogetherProfiling
–identifytypicalbehaviorinthedataDatareduction–filterorgroupthedatatosimplifytheanalysisWhataresomesupervisedapproaches?Classification
–predictwhetherdatabelongstooneclassoranotherSimilaritymatching–groupdatabyattributesRegression
–predictaspecificvalueLinkprediction–socialnetworksCausalmodeling–aneventinfluencesanotherUseaflowcharttoidentifyanappropriateapproach.Q.Whatisthemaindifferencebetweensupervisedandunsupervisedmethods?Howdoyouperformprofiling?LO3-2Profilingreliesongatheringsummarystatisticsandidentifyingoutliers.Identifytheobjectsoractivityyouwanttoprofile.Determinethetypesofprofilingyouwanttoperform.Setboundariesorthresholdsfortheactivity.Interprettheresultsandmonitortheactivityand/orgeneratealistofexceptions.Followuponexceptions.Whataresomeexamplesofprofiling?Internalauditorsanalyzetravelandentertainmentexpensesforviolationsofinternalcontrols.Managersuseprofilingtocomparevariancesfromtargetranges.Whataresomeexamplesofprofiling?Inthecontinuousaudit,anauditormayuseBenford’sLawtoevaluatethefrequencydistributionofthefirstdigitsfromalargesetofnumericaldata.Q.Profilingisusedinlawenforcementforoffenderorcriminalprofiling.Howdoesthiscomparewithprofilingaccountingdata?Howdoyouperformdatareduction?LO3-3Datareductionisusedtofilterresults.1.Identifytheattributeyouwouldliketoreduceorfocuson.2.Filtertheresults.3.Interprettheresults.4.Followupontheresults.Whataresomeexamplesofdatareduction?InternalauditorsmaywanttolocatepaymentsmadetoSquarevendors.FinancialstatementanalystswilltakeXBRLinstancedocumentsandfilteronspecifictags.Q.HowmightthedatareductionapproachbeusedtosimplifyT&Eexpenses?Howdoyouperformregression?LO3-4Regressionallowstheaccountanttodevelopmodelstopredictexpectedoutcomes.Identifythevariablesthatmightpredictanoutcome.Determinethefunctionalformoftherelationship.Identifytheparametersofthemodel.Dependentvariable=f(independentvariables)Whataresomeexamplesofregression?Inmanagerialaccounting,regressionmaypredictemployeeturnover:Employeeturnover=f(currentprofessionalsalaries,healthoftheeconomy[GDP],salariesofferedbyotheraccountingfirmsorbycorporateaccounting,etc.)Inauditing,regressionmaybeusedtodeterminetheappropriatenessofallowanceaccounts:Allowanceforloanlosesamount=f(currentagedloans,loantype,customerloanhistory,collectionssuccess)Q.Regressionisusedtopredictstockreturnsfollowingarestatementofpastearnings.Whatfactors(independentvariables)doyouthinkmightpredictthechangeinstockprice(dependentvariable)?Howdoyouperformclassification?LO3-5Thegoalofclassificationistopredictwhetheranindividualweknowverylittleaboutwillbelongtooneclassoranother.Identifytheclassesyouwishtopredict.Manuallyclassifyanexistingsetofrecords.Selectasetofclassificationmodels.Divideyourdataintotrainingandtestingsets.Generateyourmodel.Interprettheresultsandselectthe“best”model.Whatelsedoyouneedtoknowaboutclassification?Trainingdataareexistingdatathathavebeenmanuallyevaluatedandassignedaclass.Testdataareexistingdatausedtoevaluatethemodel.Decisiontreesareusedtodividedataintosmallergroups.Decisionboundariesmarkthesplitbetweenoneclassandanother.Whatelsedoyouneedtoknowaboutclassification?Pruning
removesbranchesfromadecisiontreetoavoidoverfittingthemodel.Whatelsedoyouneedtoknowaboutclassification?Linearclassifiersareusefulforrankingitemsratherthansimplypredictingclassprobability.Theseareusefulfordeterminingthereallyimportantvalues,suchasvaluablecustomers,orwhichtransactionsaremostlikelyfraudulent.Whatelsedoyouneedtoknowaboutclassification?Supportvectormachineisadiscriminatingclassifierthatisdefinedbyaseparatinghyperplanethatworksfirsttofindthewidestmargin(orbiggestpipe)andthenworkstofindthemiddleline.Howdoweevaluateclassifiers?Trytoavoidoverfitting,ormodelsthataretooaccurate.Theyareactuallyprettybadapredictingafutureobservation.L
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。
最新文檔
- 福建師范大學(xué)《景觀設(shè)計一》2022-2023學(xué)年第一學(xué)期期末試卷
- 福建師范大學(xué)《環(huán)境規(guī)劃》2023-2024學(xué)年第一學(xué)期期末試卷
- 火災(zāi)自動報警系統(tǒng)施工及驗收規(guī)范題目
- 幼兒園美術(shù)特色教學(xué)活動總結(jié)
- 2024屆四川成都實驗高級中學(xué)高三第一次調(diào)研考試(數(shù)學(xué)試題理)試卷
- (2024新)人教版道德與法治七年級上冊第10課保持身心健康《愛護(hù)身體》教學(xué)課件
- 回延安課件教學(xué)課件
- 2024年鄭州客運資格證應(yīng)用能力考試內(nèi)容是什么
- 2024年長治汽車客運從業(yè)資格考試
- 2024年長沙客車駕駛員考試答案
- 移動商務(wù)內(nèi)容運營(吳洪貴)項目三 移動商務(wù)運營內(nèi)容的策劃和生產(chǎn)
- 大學(xué)辦公室管理制度公司管理制度
- 妊娠合并癥 妊娠合并心臟?。▼D產(chǎn)科護(hù)理課件)
- 35KV集電線路首次送電啟動方案
- 畫法幾何及水利土建制圖習(xí)題答案
- 臨床流行病學(xué)智慧樹知到答案章節(jié)測試2023年南方醫(yī)科大學(xué)
- 2021年高考浙江卷英語試題(含答案)
- 幼兒園教師職業(yè)道德第七單元
- GB/T 5530-2005動植物油脂酸值和酸度測定
- GB/T 14194-2017壓縮氣體氣瓶充裝規(guī)定
- GB 7947-2006人機(jī)界面標(biāo)志標(biāo)識的基本和安全規(guī)則導(dǎo)體的顏色或數(shù)字標(biāo)識
評論
0/150
提交評論