Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống
1
/ 21 trang
THÔNG TIN TÀI LIỆU
Thông tin cơ bản
Định dạng
Số trang
21
Dung lượng
681,23 KB
Nội dung
ECON1193 – BUSINESS STATISTIC Assignment – Individual Case Study – Inferential Statistic Topic 2: Age dependency ratio, old (%working-age population) Student’s name Student ID Lecturer: Campus: Mai Minh Ngoc S3730089 Pham Thi Minh Thuy RMIT Hanoi Summarize descriptive statistic In the previous assignment, I used Descriptive Statistic to conclude about the ratio of old-age dependency (% working-age population) of nations that Brazil and Kenya This shows an evidence in the tables: Central Tendency Mean Median Mode Brazil 8.103964641 7.552690108 #N/A Kenya 5.275461043 5.251903511 #N/A Table 1: Central Tendency of Brazil and Kenya Coefficient of Country Range Interquartile Range Variance Standard Deviation Variation Brazil 5.376663755 2.605952993 2.650233071 1.627953645 20.088% Kenya 1.92380989 0.97702102 0.35234497 0.593586531 11.25% Table 2: The measures of Variation of Brazil and Kenya From two tables above about the age dependency ratio, old in Brazil and Kenya from 1980 to 2016 in Assignment This assignment answered the topic question is “Is the age dependency ratio and income are related?”, which the answer is Yes We know that, Brazil have higher income than Kenya, and furthermore, all the number of Brazil much hinger than Kenya about the Mean, Median, Variance, Standard Deviation and CV With the table 3, all the percentage of age dependency and income always depend to each other The higher income, the higher ratio of old age dependency and vice versa P(HI) 0.4 P(HI|LO) P(HI|HO) P(MI) 0.457142857 P(MI|LO) 0.761904762 P(MI|HO) P(LI) 0.142857143 P(LI|LO) 0.238095238 P(LI|HO) Table The probability of the percentage age dependency and GNI Word count: 134 Confidence intervals a Calculate confidence intervals In this section, I’m going to calculate about Confidence Interval of age dependency ratio, old (% working-age population), gross national income (GNI) per capita (current US$) and life expectancy at birth, total (years) of 35 countries The Sample size of Dataset given is 35 countries, n = 35 In this part 2a, all calculation will choose Significant level at 5%, � = 0.05 → confidence level is 95% (1 – � = 0.95) In this part, we know about the information that: - n = 35 - population standard deviation � is unknown → So, I’m going to use t-table o Substitute the sample standard deviation, S o n = 35, degree of freedom, d.F = n - = 35 - = 34 o Lower, Upper tail = → α = 0.05 = 0.025 tn-1, (α)⁄2 = 2.0322 Because of σ is unknown, so confidence interval estimated by: �= ´X ± tn-1 S √n t is the critical value of the t distribution, o d.F = n-1 o Each tail = α Age dependency ratio, old (% working-age population) o n = 35 o d.F = n – = 35 – = 34 o tn-1, (α)⁄2 = 2.0322 → �= ´X + tn-1 �= ´X - tn-1 S √n = 16.121 + (2.0322) 10.280 √35 = 19.652 S √n = 16.121 - (2.0322) 10.280 √35 = 12.590 12.590 ≤ � ≤ 19.6522 As a result, we are 95% confident that the true mean age dependency ratio, old is between 12.590 and 19.652 This suggest that we are 95% sure that the ratio of elder (% working-age population) in the world in 2015 is between 12.590% and 19.652% Gross National Income (GNI) per capita (Current US$) o n = 35 o d.F = n – = 35 – = 34 o tn-1, (α)⁄2 = 2.0322 → �= ´X + tn-1 �= ´X - tn-1 S √n = 19594 + (2.0322) 23429.862 √ 35 = 27642.262 S √n = 19594 - (2.0322) 11545.737 ≤ � ≤ 27642.262 23429.862 √ 35 = 11545.737 We are 95% confident that the true mean of GNI per capita (current US$) is between 11545.737 and 27642.262 This is suggesting that in 2015, we are sure that 95% average person in the world earn US$ 11545.737 and US$ 27642.262 Life expectancy at birth, total (years) o n = 35 o d.F = n – = 35 – = 34 o tn-1, (α)⁄2 = 2.0322 → �= ´X + tn-1 �= ´X - tn-1 S √n S √n = 73.328 + (2.0322) = 73.328 - (2.0322) 9.098 √ 35 9.098 √ 35 = 76.453 = 70.202 70.202 ≤ � ≤ 76.453 As a result, we are 95% confident that the mean of life expectancy at birth of the world in 2015 is between 70.202 and 76.453 This suggest we are 95% sure that an average newborn expectancy in the world in 2015 is between 70.202 and 76.453 years b Discuss, what assumption are required to calculates these confidence intervals The sample size of dataset given is 35, n = 35 is large enough (n ≥ 30), we can apply Central Limit Theorem and sampling distribution for the mean can be approximated by normal distribution Hence, we can use the normal distribution and the t-table without any assumptions of the population c Discuss the possible impact on the confidence intervals, if the sample size increase double According to the formula of confidence interval is �1 = ´X ± tn-1 S (with n1 = n) √n With confidence interval �1 = ´X The new confidence interval is �2 = Width confidence interval �2 = ´X - tn-1 S √n ´X ± tn-1 + tn-1 ´X + tn-1 - S √2 n S √n = 2.(tn-1 S ) √n S (with n2 = 2n) √2 n - ´X + tn-1 S √2 n = 2.(tn-1 S √2 n ) If the sample size (n increases, the standard error S √n will decrease because (n) is a denominator The data is more concentrated around the mean The width confidence interval (± tn-1 S ) will decrease √n √ times Therefore, the confidence intervals will be more accurate Interpretation: Increasing the sample size mean we are studying 70 nations instead of 35 Hence, the result of the world ratio of old-age dependency is represent better of all 195 countries in the world A 95% CI, n=70 will be narrower than a 95%, n=35 Therefore, we can easily predict more accurately about the percentage retired people (% working-age population) in the world Word count: 174 Hypothesis testing a) According to the report from United National the world average old age dependency ratio is 12.9% in 2013 The data from the report of United National shows that the average percentage of elder dependency in 2013 is 12.9% (%working-age population) However, the calculation in part shows that the average of age dependency ratio, old is about 16.121% (%working-age population) It’s much higher than the percentage in 2013, (16.121% ˃ 12.9%) Therefore, the ratio of the world’s elder dependence expected in the future will increase Age dependency ratio, old (%working-age population) Mean 16.12118821 Table 4: The mean of age dependency ratio, old (%working-age population) Word count: 67 b) Hypothesis testing Choosing Confidence level is 95% → level of significant (�) = 0.05 No assumption required because sample size given is 35, n = 35 is large enough (n ≥ 30), so we can apply Central Limit Theorem We can be using Ztable or t-table - We called the Null Hypothesis is H0, and the Alternative Hypothesis is H1 The age dependency ratio, old of the world in 2013 is 12.9% H0: ≤ 12.9 (old age dependency ratio is less than or equal to 12.9%) H1: ˃ 12.9 (old age dependency ratio is more than 12.9%) → This is one tailed test – upper tail Since the population standard deviation, � is unknown → we are using t-table: The test statistic we use is: t = o t= ´ X−μ S √n = 16.121 −12.9 10.280 √ 35 ´ X−μ S √n = 1.853 (1) � = 0.05 because we are using one tailed test, so the � will present for the upper rejection region o d.F = n – = 35 -1 = 34 o � = 0.05 → t n-1, α = 1.6909 (the critical value of t is 1.6909) (2) (1) > (2), as the test statistic falls into in the rejection region (t = 1.853 > 1.6909) → Hence the Null Hypothesis (H0) is rejected Interpretation: As reject H0, hence with 95% level of confidence it can be concluded that the age dependency ratio of old people of the world will not decrease in the future As H0 is rejected, hence type I error might have been committed → � = P(Type I) = 5% o Despite the test result, there are still have 5% of chance in the future, the ratio of elder in the world might decrease more than 12.9% Minimizing type I error, we need to increase the confidence level = (1 – �) to decrease level of significant (�) Critical value will be more extreme and lager than the test statistic (CV > 1.853), hence we not reject H Regression analysis a) Determine the dependence variable (DV) and the independent variables (IV) of the dataset Dependence variable (DV) o Age dependency ratio, old (%working-age population) Independent variable (IV) o Gross Nation Income (GNI) per capita (current US$) o Domestic general government health expenditure per capita, PPP (current international $) o Life expectancy at birth, total (years) o Smoking prevalence, total (age 15+) b) For each independent variables (IV), the relationship between DV and IV b.1) Age dependency ratio, old (%working-age population) & Gross National Income, per capita (current US$) o Expected relationship: I expected to see a positive linear between age dependency ratio, old (DV) with the GNI per capita (IV) Age dependency ratio, old (%working-age population) Age dependency ratio, old & GNI per capita 40 35 f(x) = x + 9.62 R² = 0.57 30 25 20 15 10 0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000 GNI per capita (US$) Figure […] The scatter plot of age dependency ratio, old & GNI per capita (US$) o Comment: According to the scatter plot, there is a little week linear relationship between the ratio of elder dependency and GNI per person; (0