版權(quán)說(shuō)明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡(jiǎn)介
1、計(jì)量經(jīng)濟(jì)學(xué)導(dǎo)論ch4課件計(jì)量經(jīng)濟(jì)學(xué)導(dǎo)論ch4課件Statistical inference in the regression modelHypothesis tests about population parametersConstruction of confidence intervals Sampling distributions of the OLS estimatorsThe OLS estimators are random variablesWe already know their expected values and their variancesHowever, f
2、or hypothesis tests we need to know their distributionIn order to derive their distribution we need additional assumptionsAssumption about distribution of errors: normal distributionMultiple RegressionAnalysis: InferenceStatistical inference in the rAssumption MLR.6 (Normality of error terms)indepen
3、dently ofIt is assumed that the unobservedfactors are normally distributed around the population regression function.The form and the variance of the distribution does not depend onany of the explanatory variables.It follows that:Multiple RegressionAnalysis: InferenceAssumption MLR.6 (Normality ofDi
4、scussion of the normality assumptionThe error term is the sum of many“ different unobserved factorsSums of independent factors are normally distributed (CLT)Problems:How many different factors? Number large enough?Possibly very heterogenuous distributions of individual factorsHow independent are the
5、 different factors?The normality of the error term is an empirical questionAt least the error distribution should be close“ to normalIn many cases, normality is questionable or impossible by definitionMultiple RegressionAnalysis: InferenceDiscussion of the normality asDiscussion of the normality ass
6、umption (cont.)Examples where normality cannot hold:Wages (nonnegative; also: minimum wage)Number of arrests (takes on a small number of integer values)Unemployment (indicator variable, takes on only 1 or 0)In some cases, normality can be achieved through transformations of the dependent variable (e
7、.g. use log(wage) instead of wage)Under normality, OLS is the best (even nonlinear) unbiased estimatorImportant: For the purposes of statistical inference, the assumption of normality can be replaced by a large sample sizeMultiple RegressionAnalysis: InferenceDiscussion of the normality asTerminolog
8、yTheorem 4.1 (Normal sampling distributions)Under assumptions MLR.1 MLR.6:The estimators are normally distributed around the true parameters with the variance that was derived earlierThe standardized estimators follow a standard normal distributionGauss-Markov assumptions“Classical linear model (CLM
9、) assumptions“Multiple RegressionAnalysis: InferenceTerminologyUnder assumptions MTesting hypotheses about a single population parameterTheorem 4.1 (t-distribution for standardized estimators)Null hypothesis (for more general hypotheses, see below)Under assumptions MLR.1 MLR.6:If the standardization
10、 is done using the estimated standard deviation (= standard error), the normal distribution is replaced by a t-distributionThe population parameter is equal to zero, i.e. after controlling for the other independent variables, there is no effect of xj on y Note: The t-distribution is close to the sta
11、ndard normal distribution if n-k-1 is large.Multiple RegressionAnalysis: InferenceTesting hypotheses about a sint-statistic (or t-ratio)Distribution of the t-statistic if the null hypothesis is trueGoal: Define a rejection rule so that, if it is true, H0 is rejected only with a small probability (=
12、significance level, e.g. 5%)The t-statistic will be used to test the above null hypothesis. The farther the estimated coefficient is away from zero, the less likely it is that the null hypothesis holds true. But what does far“ away from zero mean? This depends on the variability of the estimated coe
13、fficient, i.e. its standard deviation. The t-statistic measures how many estimated standard deviations the estimated coefficient is away from zero. Multiple RegressionAnalysis: Inferencet-statistic (or t-ratio)The t-Test against .Reject the null hypothesis in favour of the alternative hypothesis if
14、the estimated coef-ficient is too large“ (i.e. larger than a criti-cal value).Construct the critical value so that, if the null hypothesis is true, it is rejected in,for example, 5% of the cases.In the given example, this is the point of the t-distribution with 28 degrees of freedom that is exceeded
15、 in 5% of the cases.! Reject if t-statistic greater than 1.701Multiple RegressionAnalysis: InferenceTesting against one-sided alternatives (greater than zero)Test againsExample: Wage equationTest whether, after controlling for education and tenure, higher work experience leads to higher hourly wages
16、Standard errorsTest against .One would either expect a positive effect of experience on hourly wage or no effect at all.Multiple RegressionAnalysis: InferenceExample: Wage equationStandardExample: Wage equation (cont.)The effect of experience on hourly wage is statistically greater than zero at the
17、5% (and even at the 1%) significance level.“t-statisticDegrees of freedom;here the standard normal approximation appliesCritical values for the 5% and the 1% significance level (these are conventional significance levels). The null hypothesis is rejected because the t-statistic exceeds the critical
18、value.Multiple RegressionAnalysis: InferenceExample: Wage equation (cont.)Test against .Reject the null hypothesis in favour of the alternative hypothesis if the estimated coef-ficient is too small“ (i.e. smaller than a criti-cal value).Construct the critical value so that, if the null hypothesis is
19、 true, it is rejected in,for example, 5% of the cases.In the given example, this is the point of the t-distribution with 18 degrees of freedom so that 5% of the cases are below the point.! Reject if t-statistic less than -1.734Multiple RegressionAnalysis: InferenceTesting against one-sided alternati
20、ves (less than zero)Test againsExample: Student performance and school sizeTest whether smaller school size leads to better student performanceTest against .Do larger schools hamper student performance or is there no such effect?Percentage of studentspassing maths testAverage annual tea-cher compens
21、ationStaff per one thou-sand studentsSchool enrollment(= school size)Multiple RegressionAnalysis: InferenceExample: Student performance aExample: Student performance and school size (cont.)One cannot reject the hypothesis that there is no effect of school size on student performance (not even for a
22、lax significance level of 15%).t-statisticDegrees of freedom;here the standard normal approximation appliesCritical values for the 5% and the 15% significance level.The null hypothesis is not rejected because the t-statistic is not smaller than the critical value.Multiple RegressionAnalysis: Inferen
23、ceExample: Student performance aExample: Student performance and school size (cont.)Alternative specification of functional form:Test against .R-squared slightly higher Multiple RegressionAnalysis: InferenceExample: Student performance aExample: Student performance and school size (cont.)The hypothe
24、sis that there is no effect of school size on student performance can be rejected in favor of the hypothesis that the effect is negative.t-statisticCritical value for the 5% significance level ! reject null hypothesisHow large is the effect?+ 10% enrollment ! -0.129 percentage points students pass t
25、est(small effect)Multiple RegressionAnalysis: InferenceExample: Student performance aTesting against two-sided alternativesTest against .Reject the null hypothesis in favour of the alternative hypothesis if the absolute value of the estimated coefficient is too large.Construct the critical value so
26、that, if the null hypothesis is true, it is rejected in,for example, 5% of the cases.In the given example, these are the points of the t-distribution so that 5% of the caseslie in the two tails.! Reject if absolute value of t-statistic is less than -2.06 or greater than 2.06Multiple RegressionAnalys
27、is: InferenceTesting against two-sided alteExample: Determinants of college GPALectures missed per weekThe effects of hsGPA and skipped are significantly different from zero at the 1% significance level. The effect of ACT is not significantly different from zero, not even at the 10% significance lev
28、el. For critical values, use standard normal distributionMultiple RegressionAnalysis: InferenceExample: Determinants of colleStatistically significant“ variables in a regressionIf a regression coefficient is different from zero in a two-sided test, the corresponding variable is said to be statistica
29、lly significant“If the number of degrees of freedom is large enough so that the nor-mal approximation applies, the following rules of thumb apply:statistically significant at 10 % level“statistically significant at 5 % level“statistically significant at 1 % level“Multiple RegressionAnalysis: Inferen
30、ceStatistically significant“ vaGuidelines for discussing economic and statistical significanceIf a variable is statistically significant, discuss the magnitude of the coefficient to get an idea of its economic or practical importanceThe fact that a coefficient is statistically significant does not n
31、ecessa-rily mean it is economically or practically significant!If a variable is statistically and economically important but has the wrong“ sign, the regression model might be misspecified If a variable is statistically insignificant at the usual levels (10%, 5%, 1%), one may think of dropping it fr
32、om the regressionIf the sample size is small, effects might be imprecisely estimated so that the case for dropping insignificant variables is less strongMultiple RegressionAnalysis: InferenceGuidelines for discussing econTesting more general hypotheses about a regression coefficientNull hypothesist-
33、statisticThe test works exactly as before, except that the hypothesized value is substracted from the estimate when forming the statisticHypothesized value of the coefficientMultiple RegressionAnalysis: InferenceTesting more general hypotheseExample: Campus crime and enrollmentAn interesting hypothe
34、sis is whether crime increases by one percent if enrollment is increased by one percentThe hypothesis is rejected at the 5% levelEstimate is different from one but is this difference statistically significant?Multiple RegressionAnalysis: InferenceExample: Campus crime and enroComputing p-values for
35、t-testsIf the significance level is made smaller and smaller, there will be a point where the null hypothesis cannot be rejected anymoreThe reason is that, by lowering the significance level, one wants to avoid more and more to make the error of rejecting a correct H0The smallest significance level
36、at which the null hypothesis is still rejected, is called the p-value of the hypothesis testA small p-value is evidence against the null hypothesis because one would reject the null hypothesis even at small significance levelsA large p-value is evidence in favor of the null hypothesisP-values are mo
37、re informative than tests at fixed significance levelsMultiple RegressionAnalysis: InferenceComputing p-values for t-testsHow the p-value is computed (here: two-sided test)The p-value is the significance level at which one is indifferent between rejecting and not rejecting the null hypothesis. In th
38、e two-sided case, the p-value is thus the probability that the t-distributed variable takes on a larger absolute value than the realized value of the test statistic, e.g.:From this, it is clear that a null hypothesis is rejected if and only if the corresponding p-value is smaller than the significan
39、ce level.For example, for a significance level of 5% the t-statistic would not lie in the rejection region.value of test statisticThese would be the critical values for a 5% significance levelMultiple RegressionAnalysis: InferenceHow the p-value is computed (hCritical value oftwo-sided testConfidenc
40、e intervalsSimple manipulation of the result in Theorem 4.2 implies thatInterpretation of the confidence intervalThe bounds of the interval are randomIn repeated samples, the interval that is constructed in the above way will cover the population regression coefficient in 95% of the cases Lower boun
41、d of the Confidence intervalUpper bound of the Confidence intervalConfidence levelMultiple RegressionAnalysis: InferenceCritical value ofConfidence inConfidence intervals for typical confidence levelsRelationship between confidence intervals and hypotheses tests reject in favor of Use rules of thumb
42、Multiple RegressionAnalysis: InferenceConfidence intervals for typicExample: Model of firms R&D expendituresSpending on R&DAnnual salesProfits as percentage of salesThe effect of sales on R&D is relatively precisely estimated as the interval is narrow. Moreover, the effect is significantly different
43、 from zero because zero is outside the interval.This effect is imprecisely estimated as the in-terval is very wide. It is not even statisticallysignificant because zero lies in the interval.Multiple RegressionAnalysis: InferenceExample: Model of firms R&D eTesting hypotheses about a linear combinati
44、on of parametersExample: Return to education at 2 year vs. at 4 year collegesYears of education at 2 year collegesYears of education at 4 year collegesTest against .A possible test statistic would be:The difference between the estimates is normalized by the estimated standard deviation of the differ
45、ence. The null hypothesis would have to be rejected if the statistic is too negative“ to believe that the true difference between the parameters is equal to zero.Multiple RegressionAnalysis: InferenceTesting hypotheses about a linInsert into original regressionImpossible to compute with standard reg
46、ression output becauseAlternative methodUsually not available in regression outputDefine and test against .a new regressor (= total years of college)Multiple RegressionAnalysis: InferenceInsert into original regressioEstimation resultsThis method works always for single linear hypothesesTotal years
47、of collegeHypothesis is rejected at 10% level but not at 5% levelMultiple RegressionAnalysis: InferenceEstimation resultsTotal years Testing multiple linear restrictions: The F-testTesting exclusion restrictions Years in the leagueAverage number of games per yearSalary of major lea-gue base ball pla
48、yerBatting averageHome runs per yearRuns batted in per yearagainstTest whether performance measures have no effect/can be exluded from regression.Multiple RegressionAnalysis: InferenceTesting multiple linear restriEstimation of the unrestricted modelNone of these variabels is statistically significa
49、nt when tested individuallyIdea: How would the model fit be if these variables were dropped from the regression?Multiple RegressionAnalysis: InferenceEstimation of the unrestrictedEstimation of the restricted modelTest statisticThe sum of squared residuals necessarily increases, but is the increase
50、statistically significant?The relative increase of the sum of squared residuals when going fromH1 to H0 follows a F-distribution (ifthe null hypothesis H0 is correct) Number of restrictionsMultiple RegressionAnalysis: InferenceEstimation of the restricted mRejection rule (Figure 4.7)A F-distributed
51、variable only takes on positive values. This corresponds to the fact that the sum of squared residuals can only increase if one moves from H1 to H0.Choose the critical value so that the null hypo-thesis is rejected in, for example, 5% of the cases, although it is true.Multiple RegressionAnalysis: InferenceRejection rule (Figure 4.7)A FTest decision in exampleDiscussionThe three variables are jointly significant“They were not significant when tested individuallyThe likely reason is multicollinearity between them Number of restrictions to be tested Degrees of freedo
溫馨提示
- 1. 本站所有資源如無(wú)特殊說(shuō)明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁(yè)內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒(méi)有圖紙預(yù)覽就沒(méi)有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫(kù)網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。
最新文檔
- 2025年浙江建筑安全員《A證》考試題庫(kù)及答案
- 2025江西省安全員-C證考試(專職安全員)題庫(kù)及答案
- 2025陜西省建筑安全員《B證》考試題庫(kù)及答案
- 2025浙江省安全員A證考試題庫(kù)附答案
- 二十四節(jié)氣之立春課件模板
- 《蛋糕制作》課件
- 工藝管道培訓(xùn)課件
- 《海南??谙母?jìng)》課件
- 單位管理制度收錄大合集【人員管理篇】
- 有機(jī)玻璃操作箱行業(yè)行業(yè)發(fā)展趨勢(shì)及投資戰(zhàn)略研究分析報(bào)告
- 小豆苗產(chǎn)品介紹門(mén)診端
- 空調(diào)安裝和維修的培訓(xùn)
- ??祱?zhí)法記錄儀解決方案
- 液化氣供應(yīng)站安全管理制度和營(yíng)業(yè)制度
- GB/T 21385-2008金屬密封球閥
- GB/T 18994-2003電子工業(yè)用氣體高純氯
- 文言文閱讀訓(xùn)練:《三國(guó)志-武帝紀(jì)》(附答案解析與譯文)
- (完整版)招聘面試方案設(shè)計(jì)與研究畢業(yè)論文設(shè)計(jì)
- 調(diào)休單、加班申請(qǐng)單
- 肉制品生產(chǎn)企業(yè)名錄296家
- 規(guī)劃設(shè)計(jì)收費(fèi)標(biāo)準(zhǔn)
評(píng)論
0/150
提交評(píng)論