If sample size > 30
and the distribution is non-normal then:
- If population variance is known, we use z-statistic
- If population variance is unknown, we use t-statistic. Even z-statistic is acceptable, but t-statistic is more common.
Application in Finance
Let's understand this with stock market returns:
Case 1: Normal Distribution
We can analyze the daily returns of S&P 500 index. The returns approximate a normal distribution:
Case 2: Non-Normal Distribution
Let's take one more example, this time using Bitcoin daily returns, which are typically non-normally distributed (showing high kurtosis and skewness):
-
If our sample size is small, say we're analyzing 20 days of returns (n < 30)
:
- We cannot create reliable confidence intervals
- We need to use non-parametric methods instead
-
If we're analyzing 60 days of returns (n > 30):
- We can create confidence intervals due to Central Limit Theorem
- We will use t-statistic as population variance is unknown
Key Statistical Tests
Before applying these methods, it's crucial to:
-
Test for normality using:
- Jarque-Bera test (common in finance)
- Shapiro-Wilk test
- Visual inspection of Q-Q plots
-
Consider sample size:
- Small samples require stricter assumptions
- Larger samples are more forgiving due to Central Limit Theorem