Pick two stock symbol and download one year’s worth of data. For this exercise, we will discard the time data and only focus on the distribution of the opening prices within the year. Let X and Y be the two random variables denoting the opening prices of the two stocks, respectively. Using Python,
(a) Find the mean, median, and mode for X and Y.
(b) Find the variance for X and Y.
(c) Plot the normal distribution for X and Y (using the mean and variance). Plot the histogram for X and Y. Compare the histogram to the normal distribution.
(d) Find the probability of observing an opening price greater than a constant value of k ?
(e) Find the 2-dimensional mean and the covariance matrix for these two variables.
(f) Find the correlation between X and Y.
(g) Draw a scatterplot to show the relationship between X and Y.
What does each of the above task tell you about the data?