Question 1

A population

Accepted Answer

is the set of all the possible items to be observed.
example: Whilst investigating the height of males in Wales, the population would be the height of all the males in Wales.

Question 2

Random sampling:

Accepted Answer

this method gives every item of the population an equal chance of selection. This can be done in various ways for example by simply picking out of a hat or by using a random number generator on a calculator.

Question 3

Stratified sampling:

Accepted Answer

some populations are naturally split into a number of strata (kind of like sub groups). We can separate the strata and find what proportion of the population is in each stratum. We can then select a random sample from each stratum proportional to its size

Question 4

measure of central tendency

Accepted Answer

is just a mathematical and rather posh way of saying "averages".

Question 5

The Mode

Accepted Answer

It is the piece or pieces of data that occur most often.

Question 6

The Median

Accepted Answer

The median is the middle piece of data when the data is in numerical order.->With 50 pieces of data, even, we must find halfway and the next value. In this case, the 25th and 26th values. The median will be halfway between these values.

Question 7

The Mean

Accepted Answer

The mean of a set of data is the sum of all the values divided by the number of values.
-  Ex
x=-----
    n

Question 8

Ex

Accepted Answer

just means the sum of all the x’s - for instance, add all the bits of data together.

Question 9

grouped frequency table->find MEAN

Accepted Answer

We can however, find an estimate of the mean by assuming each footballer is the height halfway within his interval

Question 10

(f)

Accepted Answer

means frequency

Question 11

"o- "definition

Accepted Answer

standart deviation->gives a measure of how the data is dispersed about the mean->the lower the standard deviation, the more compact our data is around the mean

Question 12

Formula standart deviation

Accepted Answer

square root of 
((the sum of x2 - ((mean of x)squared)) 
divided by the number of units

Question 13

"o- 2" definition

Accepted Answer

The variance is the square of the standard deviation.

Question 14

The variance

Accepted Answer

is the square of the standard deviation.

Question 15

m-and-leaf diagram

Accepted Answer

presenting it in an easy and quick way to help spot patterns in the spread of data->They are best used when we have a relatively small set of data and want to find the median or quartiles

Question 16

Box-and-whisker plots (or boxplots)

Accepted Answer

These are very basic diagrams used to highlight the quartiles and median to give a quick and clear way of presenting the spread of the data.

Question 17

Negatively skewed distribution:

Accepted Answer

There is a greater proportion of the data at the upper end.

Question 18

Positively skewed distribution:

Accepted Answer

There is a greater proportion of the data at the lower end.

Question 19

Outliers

Accepted Answer

Values of data are usually labelled as outliers if they are more than 1.5 times of the inter-quartile range from either quartile.

Question 20

Histograms

Accepted Answer

Histograms are best used for large sets of data, especially when the data has been grouped into classes. They look a little similar to bar charts or frequency diagrams. ->In histograms, the frequency of the data is shown by the area of the bars and not ju

Question 21

frequency density

Accepted Answer

The vertical axis of a histogram is labelled

Question 22

Cumulative frequency

Accepted Answer

is kind of like a running total. We add each frequency to the ones before to get an ‘at least’ total.

Question 23

cumulative frequency curve.

Accepted Answer

cumulative frequencies (‘at least’ totals) are plotted against the upper class boundaries to give us a cumulative frequency curve.

Question 24

P(A)

Accepted Answer

The probability that an event, A, will happen is written as

Question 25

complement of A

Accepted Answer

The probability that the event A, does not happen is called the complement of A and is written as A'

Question 26

mutually exclusive

Accepted Answer

Two events are mutually exclusive if the event of one happening excludes the other from happening->they both cannot happen simultaneously->When a fair die is rolled find the probability of rolling a 4 or a 1.
P(4 u 1) = P(4) + P(1)=>1/6 +1/6=>1/3

Question 27

Independent Events

Accepted Answer

Two events are independent if the occurrence of one happening does not affect the occurrence of the other.->P(A and B) = P(A) ' P(B)
           ->P(A  n B) = P(A) ' P(B)
Independent events will involve ‘and’, ‘both’,"either"->means multiply

Question 28

How do you write Find the probability that given he falls P(F) it was a rainy day P(R).

Accepted Answer

P(R I F)

Question 29

discrete random variable

Accepted Answer

A random variable is a variable which takes numerical values and whose value depends on the outcome of an experiment. It is discrete if it can only take certain values.

Question 30

random variable

Accepted Answer

is a variable which takes numerical values and whose value depends on the outcome of an experiment

Question 31

exclusive events Rewrite -> Sum?

Accepted Answer

E P(X = x) = 1 -> always sum to 1

Question 32

Probability density function

Accepted Answer

Sometimes we are given a formula to calculate probabilities. We call this the probability density function of X or the p.d.f. of X.

Question 33

Cumulative distribution function

Accepted Answer

‘Cumulative’ gives us a kind of running total so a cumulative distribution function gives us a running total of probabilities within our probability table. The cumulative distribution function, F(x) of X is defined as: F(x) = P(X < x)

Question 34

Expectation

Accepted Answer

The expectation is the expected value of X, written as E(X) or sometimes as u->The expectation is what you would expect to get if you were to carry out the experiment a large number of times and calculate the ‘mean’..

Question 35

uniform distribution

Accepted Answer

This is a ‘special’ discrete random variable as all the probabilities are the same.->it is possible to calculate the expectation by using the symmetry of the table. The expectation, E(X) is calculated by finding the halfway point.

Question 36

symmetry of the table

Accepted Answer

With uniform distributions it is possible to calculate the expectation by using the symmetry of the table. The expectation, E(X) is calculated by finding the halfway point.

Question 37

Expectation of any function of x

Accepted Answer

E[f(x)] =  € f(x)P(X = x)

Question 38

E(aX + b)  Equals

Accepted Answer

aE(X) + b

Question 39

E(a) Equals

Accepted Answer

a

Question 40

variance

Accepted Answer

is a measure of how spread out the values of X would be if the experiment leading to X were repeated a number of times.

Question 41

E(X)

Accepted Answer

-> mean -> u -> Example of Calculation->(0 x 0.1) + (1 x 0.2) + (2 x 0.5) + (3 x 0.2)

Question 42

Var(aX) Equals

Accepted Answer

a2Var(X)

Question 43

Var(aX + b) Equals

Accepted Answer

a2Var(X)

This means by knowing just the variance, Var(X), we can calculate other variances quickly.
Example:

Question 44

The Standard Deviation

Accepted Answer

The square root of the Variance is called the Standard Deviation of X. standard deviation is given the symbol  o-

Question 45

convert any normal distribution of X into the normal distribution of Z

Accepted Answer

(X - u) / o-

Question 46

Normal Distribution Graph

Accepted Answer

much of the data is gathered around the mean. The distribution has a characteristic ‘bell shape’ symmetrical about the mean. ->The area of the bell shape = 1.

Question 47

The standard deviation

Accepted Answer

is an important measure of the spread of our data. The greater the standard deviation, the greater our spread of data.

Question 48

§

Accepted Answer

this Greek letter just describes the area under the bell from that point!

Question 49

line of best fit’

Accepted Answer

Any line of best fit must go through the mean of x, and  the mean of y.

Question 50

linear correlation

Accepted Answer

If all (or nearly all) of these points seem to lie in a straight line

Question 51

Equation of regression line

Accepted Answer

(blank)

Question 52

Regression Line x on y->Formula for b:

Accepted Answer

Sxy / Syy

Question 53

Regression Line y on x->Formula for b:

Accepted Answer

Sxy / Sxx

Question 54

Independent/dependent variables

Accepted Answer

With the above data, x looks to be controlled, where y appears to be dependent on an experiment and x. In this case, we say that x is an independent variable and y a dependent variable. As x appears controlled and accurate we only need to calculate the re

Question 55

product moment correlation coefficient

Accepted Answer

r -> is a measure of the degree of scatter.->will lie between -1 and 1.

Question 56

"r"

Accepted Answer

The product moment correlation coefficient, r, is a measure of the degree of scatter.->will lie between -1 and 1.

Question 57

Calculate E(X)

Accepted Answer

€x times P(X = x)  / or € f(x)P(X = x)

Maths

Statistics

"Know" box contains:
Time elapsed:
Retries: