MARKETING II
EDITION
CODIFICATION
TABULATION
ANALYSIS
EDITION: Is the process of reviewing the questionnaires or data collection forms to make sure that they are complete, that answers are consistent and that instructions were followed.
CODIFICATION: Is the process of assigning numbers to the answers, to facilitate the process and analysis of results.
TABULATION: Is the process of classifying the answers in different categories, in a simple or crossed way, to analyze results.
ANALYSIS: Is the process of calculating statistics or applying statistical tests or techniques, to facilitate data interpretation and transform data in useful information for Marketing decision making.
UNIVARIATE ANALYSIS: Analysis of one variable at a time, to describe or estimate something about the target population, on the basis of the sample estimation
BIVARIATE ANALYSIS: Analysis of two variables at a time, to establish the relationship or association between them. MULTIVARIATE ANALYSIS: Analysis of more than two variables at a time, to establish the dependence or interdependence relationship between them.
Interval
Nominal
Ordinal
Mean Standard Deviation or Variance Median Interquantile Range Mode Realative and Absolute Frequences by Category
Scale Nominal
Ordinal Interval
Statistic
Mode, Absolute and Relative Frequency Median, Interquartile Range Arithmetic Variance Mean, Standard Deviation,
2 2X S pp sX X
Parameters and Statistics Population Sample Mean or Average x Proportion or Percentage p Variance s N Size n Standard Error of the Mean x Sx Standard Error of the Proportion p Sp
Descriptive
Interval
Dispersion: Standard Deviation, Variance
Descriptive Nominal
Dispersion: Frequency
3. Arihtmetic Mean: It is the sum of the values of the variable, divided by the size of the sample (average). Appropriate for data measured in interval scale or superior.
(relative frequency) of elements that are placed in each value or category of the variable. Appropriate for data measured in nominal or superior scale.
2. Interquartile Range: Is the difference between the value of the variable in Percentile 75 (third quartile) and the value of the variable in Percentile 25 (first quartile).Appropriate for data measured in ordinal or superior scale. It is less sensitive than the standard deviation in the presence of outliers or extreme values. 3. Standard Deviation: Is the square root of the summatory of the difference between each value of the variable and the arithmetic mean, elevated to the square and divided by the sample size minus the number of variables (degrees of freedom). Appropriate for data measured in interval or superior scale.
2. t test: It is used for data measures in interval or superior scale, when n is lower than 30 and the standard deviation of the population or sample is unknown.
DESCRIPTIVE STATISTICS EXCERCISE 1 The diameters of 50 screws were measured and are contained in a SPSS archive.
1. 2. Construct lthe frequency table for the Diameter variable. Calculate the standard deviation, the asymmetry and kurtosis coefficients, the median, the mode, the first and the third quartile, the sixth and seventh decile and the 54, 47 and 82 percentiles. Can we say that the data come from a normal distribution? Construct the absolute frequencies and accumulated absolute frequencies Histograms, Construct the relative frequencies and accumulated relative frequencies Polygons. Study the symmetry of the distribution.
3. 4. 5. 6.
DESCRIPTIVE STATISTICS EXCERCISE 2 The values of the blood ph in 80 individuals are contained in a SPSS archive. 1. Perform an exploratory analysis of the ph data, including a box plot, a stem and leaf diagram, and the simmetry and normality analysis. Construct the absolute and relative frequencies histograms. Construct the absolute and relative accumulated frequencies histograms.
2. 3.