Analysis Courses
Please note that I am currently offering any of my Face to Face Courses as an online course, delivered through Microsoft Teams (or similar tool of your choice). The content will remain the same, although the way the material is presented, and how activities are completed, may change.
Principles of Analysis: Descriptive and inferential Statistics
Aims:
To introduce descriptive statistics including levels of measurement, central tendency and dispersion
To introduce concepts of rates (Prevalence and Incidence)
To discuss data presentation methods
To give a basic introduction to inferential statistics - statistical tests for 2 groups, P values and confidence limits
To discuss the importance of data quality
To discuss the principles of statistical process control
Content:
This session gives a high level overview of analytical methods. Unlike other sessions there is no group activity for participants. Instead the session is delivered in a discursive manner, and aims to give a taster of some of the more detailed sessions on offer.
Duration:
3 hours
Pre-requisites
Basic Numeracy
Data Visualization
Aims:
To emphasize the importance of telling stories through data visualization
To highlight the balance between statistics, design and aesthetics
To explore the common data visualization methods
To learn how to create effective presentations
To understand the importance of promoting and updating your data visualizations
Content:
This session will teach participants a number of ways to effectively visualise their data, using a number of different tools and techniques. Using Excel and PowerPoint will be the basis of this training course, but the use of other software will also be discussed in depth through the training. The participants will learn how to incorporate statistics, design and aesthetics of both the data and the final visualisation. This training course starts with the basic graphs and charts and evolves through dashboards, Infographics and multimedia presentations. The true power of visual presentation of the data will be explored through the examples from different industries, from production, through marketing to graphic design and multimedia.
Duration:
5 days
Pre-requisites
Basic Numeracy and familiarity with Microsoft Office
Statistics and Data Insight
Aims:
To assess a research paper for statistical validity
To understand how we currently use data, how it is used by the media and how it can be misrepresented and misinterpreted
Looking at outcomes and how they are measured including Nominal, ordinal and interval measures
Measures of central tendency (mean, median, mode)
Measures of dispersion (Range, IQR, Standard Deviation)
Percentages, rates and ratios (Prevalence and Incidence)
Data distributions, and the importance of normality
Presentation rules including Chart types and their relationship to data types
The two major study designs for two groups, Independent measures and Repeated measures
Identifying whether the correct statistical test has been used based on the stated outcome
Interpreting the test results (the role of chance) including P values and Confidence Intervals
Understanding why the study result isn’t the most important statistic, including te role of power, and the importance of the outcome
Understanding the advantages of stratification of data
Understanding standardisation of data - when is it used, and the difference between direct and indirect standardisation
Understanding the advantages of looking at time series data, including run charts, moving averages and other smoothing methods
Understanding correlations and cause and effect, using scatterplots and the line of best fit (and its relation to simple forecasting– simple forecasting
Content:
This session differs from other statistics courses on offer as it has few exercise, and is instead content rich. Consequently a lot of content is covered in a relatively short time. A research paper will be sent out before the sessions that participant will be asked to read and attempt to interpret the study results. The morning and afternoon sessions will not only introduce participants to data and statistical concepts but will enable them to interpret the results of the paper and draw meaningful conclusions. The morning session will introduce participants to data and how it is commonly used / misused. We will look at how data is measured, and how it can be summarised. We will look at how data is distributed and why this is important to understanding statistical testing. Data presentation methods will also be looked at, describing the correct way of presenting different types of data. Finally the two key study designs will be introduced. The afternoon session will introduce the most common statistical tests related to the study designs and data types we have discussed. We will discuss how the role of chance affects results and how to interpret them. We will then look at other ways to present and interpret data, including stratification and standardisation of data, time series data and correlation of variables. Lastly we will revisit the results tables from our research paper and use what we have learned to interpret the results.
Duration:
1 Day
Pre-requisites
Basic Numeracy
An Introduction to Descriptive Statistics
Aims:
To present the key concepts of data measurement
To introduce summary statistics and methods of calculation
To introduce the concepts of prevalence and incidence
To outline measures of central tendency and dispersion
To introduce the 'bell shaped curve' (the normal distribution)
To investigate methods of data presentation
Content:
Starting from first principles, we will discuss basic methods for summarising data. The session aims to introduce the concepts of central tendency and dispersion in a participative manner, enabling students to calculate such summary statistics themselves. The normal distribution will be introduced, alongside concepts such as skewness and kurtosis. Methods of presenting data will also be introduced, with students gaining insight into the appropriateness(or not) of each technique.
Duration:
3 hours
Pre-requisites
Basic Numeracy
An Introduction to Inferential Statistics
Aims:
To introduce the concept of sampling and standard error
To relate the concept of standard error to that of confidence limits
To outline the principles behind formulating and testing statistical hypotheses.
To explore correlations and the principles of association and causation
Content:
Building on the previous introduction to descriptive statistics sessions, participants will be introduced to more detailed statistical methods, and the concepts behind them. Sampling issues will be addressed, in particular how they relate to the underlying population. Students will be given the opportunity to explore these relationships and develop a more detailed understanding of statistical confidence. Statistical hypotheses will be discussed, including here relationship to one-tailed and two-tailed statistical tests. Finally, correlations will be discussed alongside the principles of association and causation. Real data will be used to demonstrate the role of confounding within such associations.
Duration:
3 hours
Prerequisites:
Basic numeracy
Completion of “Introduction to Descriptive Statistics” session or similar understanding
Analysing Data: Statistical tests for 2 groups
Aims:
To understand the importance of organising and presenting data
To understand how the achieved sample size may affect results
To apply statistical principals to investigation of data
To formulate a statistical hypotheses based on the research aim
To apply relevant statistical tests to the data
Content:
This session builds on the theory introduced in the previous statistics sessions and attempts to set this in a practical context. In particular, through group work and participative learning, we aim to demonstrate how the choice of analysis links both to the underlying question, via statistical hypotheses, and choice of presentation. The session will highlight some of the common pitfalls and errors made in data analysis, and how data can be manipulated to give misleading conclusions.
Duration:
3 hours
Pre-requisites
Basic Numeracy
Completion of "Introduction to Descriptive Statistics" session, or similar understanding
Completion of "Introduction to Inferential Statistics" session, or similar understanding
Key Statistical Concepts 1 (1 Day)
Aims:
To present the key concepts of data measurement
To introduce summary statistics and methods of calculation
To introduce the concepts of prevalence and incidence
To outline measures of central tendency and dispersion
To introduce the concept of 'degrees of freedom'
To understand how data is distributed and introduce the 'bell shaped curve' (the normal distribution)
To investigate methods of data presentation
To understand the principles behind the Central Limit Theorem
To introduce the concept of sampling and standard error
To relate the concept of standard error to that of confidence limits
To understand statistical confidence (p values and confidence limits), and how to interpret
To understand and construct one sided and two sided statistical hypotheses
To outline the principles behind formulating and testing statistical hypotheses.
To explore correlations and the principles of association and causation including Pearson, Kendall’s tau and Spearman
Content:
This session will discuss some of the key statistical concepts associated with descriptive and inferential statistics. It will start from first principles and explore the different types of data that we collect, and how we can describe it usefully for ourselves and others. It will explore how we can use our data to infer things about larger populations and explore the variability in our own data. Concepts of statistical confidence and certainty will be described.
Duration:
1 Day
Prerequisites:
Basic Numeracy
PC Skills – Keyboard and mouse skills
Key Statistical Concepts 2 (1 Day)
Aims:
Formulating analytical questions and corresponding hypotheses
data considerations, including data quality
analytical design (independent vs related samples),
statistical power.
tests for two independent groups (chi-squared, independent samples t-test, Wilcoxon signed ranks),
tests for two related groups (McNemar, Paired samples t-test, Mann Whitney U),
homogeneity of variance,
tests for multiple groups (e.g one way and two way Anova, Kruskal-Wallis etc)
Content:
This session will build on the principles discussed in Key Statistical Concepts 1. Statistical testing will be discussed, and the concept of independent samples and repeated measures will be discussed. The importance of the statistical hypothesis will be emphasised, and how it relates to one tailed and two tailed significance.
Duration:
1 Day
Prerequisites:
Basic Numeracy
PC Skills – Keyboard and mouse skills
Statistical Principles in Data Analysis (2 Days)
Aims:
To introduce summary statistics and methods of calculation
To introduce distributions, including the normal distribution
To introduce the concept of sampling and standard error
To outline the principles behind formulating and testing statistical hypotheses
To introduce confidence limits and p values and their interpretation
To introduce a framework for choosing the appropriate statistical test
To understand how SPSS (or similar packages) can be utilised in analysing quantitative data
Content:
This two day course is designed to give participants an overview of basic statistics and analysis of data that is most commonly used within research practices and other areas of health care; focusing primarily on key elements of quantitative data analysis. This workshop is aimed at those currently doing research, intending to do research or those with research interests and therefore assumes participants will be familiar with some research and statistical terminology.
Duration
2 days (Four 3 hour sessions)
Pre-requisites:
Basic Numeracy
PC Skills – Keyboard and mouse skills
An Introduction to Statistical Process Control
Aims:
To emphasise the importance of understanding your data
To understand time series data
To highlight problems with existing visualisation methods
To understand natural and special cause variation
To understand different types of process control charts
To understand rules for 'out of control' processes
To discuss process capability
Content:
This session will introduce participants to the concepts of statistical process control, what it is and when it can be used effectively. In particular, participants will have the opportunity to bring their own data and produce simple process control charts from first principles.
Duration
3 hours
Prerequisites:
Basic numeracy
PLEASE NOTE: This session will be based around count data that the client has brought with them
Statistical Process Control and Capability
Aims:
To emphasise the importance of understanding your data
To understand time series data, and other related methods
To highlight problems with existing visualisation methods
To understand natural and special cause variation
To understand different types of process control charts
To understand the statistics behind the charts
To understand rules for 'out of control' processes
To discuss process capability and the problem with targets
To discuss applicability of techniques to specific work areas
Content:
This session will introduce participants to the concepts of statistical process control, what it is and when it can be used effectively. It will look at the different types of control chart available, and when each of these might be suitable. It will also look at the capability of in control processes, and the relationship to six sigma approaches. Participants will have the opportunity to bring their own data and produce simple xmR process control charts from first principles.
Duration
1 Day
Prerequisites:
Basic numeracy
Use of calculator (please bring with you)
Microsoft Excel installed on client machine (optional)
PLEASE NOTE: This session will be based around count data that the client has brought with them
Regression Methods and Models (1 Day)
Aims:
To understand the difference between independent variables, dependent variables, confounding variables and other influencing variables
To understand simple correlation methods, including correlation coefficients and R-squared
To understand 'line of best fit' based on least squares methods
To understand simple linear regression, multiple linear regression and resulting equations
To understand how non-linear data can be transformed to fit a linear model
To understand binomial logistic regression, and how statistical outputs are intrepreted
To introduce multinomial logistic regression methods
Content:
This session will enable participants to understand the principles behind linear and logistic regression. The different methods will be discussed as well as data limitations. Practical examples of how regression models can be applied will be used, and the potentil for using regression techniques as predictive models will be explored.
Duration:
1 Day
Prerequisites:
Basic numeracy
Completion of “Introduction to Descriptive Statistics” session or similar understanding
Completion of “Introduction to Inferential Statistics” session or similar understanding
An Introduction to Time Series
Aims:
Knowing the areas to consider and clarify before using any time series method
Setting up and using simple forecasting methods
Confidently using the functions FORECAST and GROWTH within MS Excel
Knowing the limitations of these functions
Identifying whether data has trend and / or seasonality
Using regression functions in MS Excel to analyse data with seasonality
Understanding the advantages of exponential smoothing models over regression models
Identifying the correct type of exponential smoothing for the data, identifying whether it is an additive or multiplicative model
Setting up and / or using exponential smoothing spreadsheets
Understanding the limitations of any forecast produced, including interpreting confidence intervals when produced
Understanding some of the practical issues with data used for time forecasting
Content:
This session will look at how we deal with time related data, and the different types of variation within it. Different methods will be discussed to identify and smooth out the affects of variation, with a view to being able to use data to predict future activity. A particular focus on practical application and the use of MS Excel will be explored, with the main emphasis on ETS models.
Duration:
1 Day
Pre-requisites
Basic Numeracy
Time Series and Forecasting (1 Day)
Aims:
To understand time series data and how it is defined
To understand the relationship with process control and regression models
To understand and identify the different elements of time series data including secular trend, cyclical variation, seasonal variation and residual variation.
To be able to identify the difference between linear and non-linear trends.
To understand when to transform data.
To understand the concept of rolling averages, weighting data and smoothing techniques including exponential smoothing.
To be able to create and use exponential smoothing techniques including those incorporating seasonality
To understand the difference between additive and multiplicative models
To understand the importance of stationarity in data for ARIMA models, and to be able to make data stationary
To know how to interpret ARIMA models and outputs, including auto-correlation and partial auto-correlation functions
To give an overview of BATS and TBATS models within R.
Content:
This session will look at how we deal with time related data, and the different types of variation within it. Different methods will be discussed to identify and smooth out the affects of variation, with a view to being able to use data to predict future activity.
Duration:
1 Day
Prerequisites:
Basic numeracy
Completion of “Introduction to Descriptive Statistics” session or similar understanding
Completion of “Introduction to Inferential Statistics” session or similar understanding
An Introduction to Epidemiological Methods
Aims:
To understand different study methods, in particular cohort, cross-sectional, and case-control
To understand event rates in different groups
To understand prevalence and incidence rates and
To understand relative risk and when it can be used
To understand the concept of odds ratios
To understand how to calculate relative risk reduction and absolute risk reduction
To understand the concept of numbers needed to treat
Content:
In this session participants will be introduced to epidemiological methods and their application. In addition to learning some simple epidemiological techniques, participants will be able to discuss the relative strengths and weaknesses of different data collection mechanisms and their associated analysis.
Duration
3 hours
Prerequisites:
Basic Numeracy
Epidemiological / Population Methods
Aims:
To understand descriptive epidemiological design strategies including Correlational, Case Study, Case Series and Cross-Sectional
To understand analytical epidemiological design strategies including Case-Control and Cohort
To understand interventional epidemiological design strategies, specifically randomised controlled trials
To understand the role of chance (p-values and confidence intervals), bias, confounding and effect modification on study results
To be able to stratify data to help remove the effect of confounding
To understand cause-effect relationships and the contribution of strength of association, biological credibility, time sequence and dose-response
To understand and calculate prevalence and incidence, crude and category specific rates
To understand and be able to perform standardisation of data – direct and indirect
To understand Relative Risk and Odds Ratios and their relation to different study designs
To understand other ratios including Standardise Mortality Ratios, Proportional Mortality Ratios
To understand and calculate a number of measures of association, including relative risk reduction, attributable risk, absolute risk reduction, numbers needed to treat / harm
To understand the Chi-Square test and associated p-values
To be able to calculate confidence intervals for Relative Risk
To understand the concept of sample size and calculate for a case control study
To understand statistical power and calculate for a case control study
To understand the need for screening tests including the nature of disease
To understand and calculate the Sensitivity, Specificity, Predicted positive and negative values of a screening test
To understand how optimal sensitivity and specificity can be determined using ROC curves
Content:
In this session participants will be introduced to epidemiological and population methods and their application. The session not only covers the most common epidemiological analysis, but details the different types of epidemiological studies alongside their strengths and weaknesses. Common issues with epidemiological studies will be discussed as well as techniques for overcoming these including stratification and standardisation methods. In addition to learning epidemiological techniques to understand how exposure and disease are associated, participants will be able to perform sample size and power calculations. Finally the issue of pro-active screening for diseases will be discussed and concepts such as sensitivity and specificity will be introduced as well as methods to calculate and maximise them, including the use of ROC curves
1 Day
Prerequisites:
Basic Numeracy, although knowledge of basic statistics would be an advantage
Indicator Development
Aims:
To understand the difference between numbers, metrics and indicators
To understand the principles of good indicators
To understand the principles of reliability and validity
To understand the principles of sensitivity and specificity (and related Type I and Type II errors)
To understand the role of data quality
To discuss indicator construction, simple and compound measures
To discuss standardisation methods (direct and indirect), and when they should be used
To discuss presentation methods and the problems of ranking
To discuss the concept of gaming and perverse incentives
To discuss assurance of methods and what help is available
Content:
This session will look at the current state of indicator production within the system, and the related problems of having poorly defined, duplicate indicators. The session will enable participants to understand the role that indicators play and the importance of them being constructed properly and in a transparent manner. The key principles of good indicator development will be discussed, and related techniques for adjusting indicators will be explored. By the end of the session participants will not only know what makes a well constructed indicator but they will have the knowledge to access help that is available
Duration:
3 hours
Pre-requisites
Basic Numeracy
Data Insight and Communication Fundamentals
Aims:
To look at how information is requested within organisations
To identify how communication is key to effective analysis
To look at the link between business and information requirements
To determine how we can be clear about what is asked of us
To detail simple analytical methods and presentation techniques
Content:
This session is delivered via the Consortium for Health Analytics Intelligence (CHAIn).. It is delivered in a more informal setting than other sessions, allowing participants to ask questions and flex the content to their own needs. It is a particular useful session who find understanding the requirements of their management teams difficult, or where there seems to be a mismatch in communication. Through examples, exercise and the trainers’ own experiences the issues will be highlighted and practical solutions discussed in how to bridge the communication gap – ensuring that the right questions are being both asked and answered.
Duration
1 Day
Prerequisites:
Basic Numeracy
Making Your Data Accessible & Understandable to Others
Aims:
To emphasise the importance of context in presenting data
To consider the Who and How of presenting
To emphasise the difference between exploratory and explanatory analysis
To introduce the concepts of the 3-minute story and the Big Idea
To look at the concepts of data storytelling
To explore some of the visualisation concepts, including pre-attentive attributes
Content:
This session is delivered via the Consortium for Health Analytics Intelligence (CHAIn). It is delivered in a more informal setting than other sessions, allowing participants to ask questions and flex the content to their own needs. The session goes from first principles and while it is applicable to all types of presentations it mainly focusses on a physical presentation in front of a physical audience. There will be ample time to explore new concepts, and start to think about data presentation as a story with clear narrative arcs. There will also be a lot of opportunity for group work, and putting your own data into a form which will help you to get your message across to those that need to hear it.
Duration
2 Hours
Prerequisites:
Basic Numeracy
Presenting Data to Multi-Professional Groups
Aims:
To emphasise the importance of context in presenting data
To consider the Who, What, Where, When, Why and How of presenting
To emphasise the difference between exploratory and explanatory analysis
To introduce the concepts of the 3-minute story and the Big Idea
To look at the concepts of data storytelling and narrative arcs
To look at the types of data that will be used for storytelling
To explore some of the visualisation concepts, including pre-attentive attributes
To detail how data types and visualisations are interconnected, and how to ensure the right visualisation is used
To explore action titles, and when to use them
To explore how to present well, to conquer nerves and leave a lasting impression
Content:
This session is delivered via the Consortium for Health Analytics Intelligence (CHAIn). It is delivered in a more informal setting than other sessions, allowing participants to ask questions and flex the content to their own needs. The session goes from first principles and while it is applicable to all types of presentations it mainly focusses on a physical presentation in front of a physical audience. There will be ample time to explore new concepts, and start to think about data presentation as a story with clear narrative arcs. There will also be a lot of opportunity for group work, and putting your own data into a form which will help you to get your message across to those that need to hear it.
Duration
2 Days
Prerequisites:
Basic Numeracy
Data Storytelling - Demystified
Aims:
To emphasise the importance of context in presenting data
To consider the Who, What, Where, When, Why and How of presenting
To emphasise the difference between exploratory and explanatory analysis
To introduce the concepts of the 3-minute story and the Big Idea
To look at the concepts of data storytelling and different narrative arcs
To look at the types of data that will be used for storytelling
To explore some of the visualisation concepts, including pre-attentive attributes
To detail how data types and visualisations are interconnected, and how to ensure the right visualisation is used
To explore action titles, and when to use them
To explore how to present well, to conquer nerves and leave a lasting impression
Content:
This session is delivered via the Consortium for Health Analytics Intelligence (CHAIn). It is similar to the session Presenting Data to Multi-Professional Groups but is delivered over one day instead of two. It is also written for a generic audience rather than those solely from Health. The session goes from first principles and while it is applicable to all types of presentations it mainly focusses on a physical presentation in front of a physical audience. There will be ample time to explore new concepts, and start to think about data presentation as a story with clear narrative arcs. There will also be a lot of opportunity for group work, and putting your own data into a form which will help you to get your message across to those that need to hear it.
Duration
1 Day
Prerequisites:
Basic Numeracy