Data Science
Pathway

Get ready to join the machine learning revolution and discover a new career in one of tech’s fastest growing sectors.

Apply now
23rd September 2024

Course summary

Duration

12 weeks part-time

Fees

£4,000
More info

Locations

online

Contact

+44 01249 475 423
hello@io-academy.uk

Overview

The world is data driven

This fully remote, online course is designed to help developers and others working in tech progress their skills into data science and machine learning. Ideal for software developers of all levels, from bootcamp graduates to senior engineers. We’ll start with a quick intro to Python to get everyone onto the same page, before going in-depth with statistics, data cleaning and analysis, machine learning and even some neural nets.

In this 12 week course, you will learn all the practical skills needed to start a successful career in data science and machine learning, working with real data sets and applying them to real-world scenarios.

Curriculum

What you'll learn

You’ll study with us twice a week for 12 weeks, with an additional drop-in session every week. After covering the initial intro to Python the focus of the first six weeks are statistics and data science, followed by the second six weeks on machine learning and more advanced topics.

The sessions, each lasting 2.5 hours, will take place on Monday and Wednesday evenings, with an optional 2 hour drop-in session every Friday to help catch up on missed content, recap difficult topics or support with take-home exercises.

The curriculum is practically driven, with real exercises throughout. Led by our industry expert trainer, Richard.

We’ll kick off by getting to grips with the fundamentals of Python. As developers you will already know how to code, but not necessarily in Python or using the stats libraries and functions required for data science. Here you will learn why Python is the go-to data language.

Now we are ready to start looking at data science. What is it? What can you use it for and why? We’ll explore all these questions and more as we begin the journey into data.

Time to travel back to school and get a refresher in foundational maths, histograms, mean, median and mode, standard deviation and more. All the basic statistics you will need.

Moving on from foundational maths, we will now look at basic probability theory, covering combinatorics, bayesian inference, and distributions.

Next we will look at inferential statistics and hypothesis testing, where we will begin reaching conclusions that extend beyond the data available.

What software is in the toolkit of a data scientist? In this session we will look at a range of tools used throughout the industry, including Jupyter Notebook, NumPy and pandas, as well as how we can better use Visual Studio Code with the right extensions for your project.

Now that we have learnt about different ways of analysing our data, we will start to visualise it. Using tools like Matplotlib, seaborn and pandas, we will start to display our data in different ways.

This session is part one of understanding how to load data into Python, read it, and clean it.

Following on from the previous class, we will now explore combining dataframes and other more complex methods of working with and cleaning data.

Time to step it up a bit. We’ll now look at more advanced data methods, what they do, and how we can use them to create meaningful outcomes.

The really interesting work begins here. With a range of real data sets we are now ready to start cleaning and analysing data to begin making predictions.

Now that you have analysed some real data, you will present your work to the team and open it up for discussion.

The halfway point! With data science under your belt, we will now move on to machine learning. We’ll start with basic definitions and examples of what you can do.

We’ll now start looking at a range of regression techniques (some simple and some complex), used to analyse relationships between variables in your data. Techniques such as multiple linear regression, Support Vector Regression (SVR), decision trees and random forests will all come into play.

Some data needs to be put into categories, or classifications, in order to better analyse it. Logistic regression, Support Vector Machine (SVM), and K Nearest Neighbour (KNN) are just some of the classification techniques we will cover.

Working with time series data can be tricky – now we will look at how to analyse this type of data and work with Hadoop Distributed File System (HDFS).

There are lots of concepts to learn when working with machine learning models. In this module we will learn about concepts including over/underfitting, bias/variance, confusion matrices, and precision vs accuracy.

Now that we know how to interpret different models, we will learn about optimising those models using techniques such as hyperparameter tuning, data augmentation, grid search and k-fold validation.

We are now ready to start a group project. Working with real data you will clean the data, select a model, and optimise it.

What is deep learning? In this module we will discover what it is, what we can use it for and how it works. We will also discuss different libraries (sklearn, pytorch, keras, tensorflow) and when to use them.

Following on from our initial discussion of deep learning and the libraries involved, we will use keras, a TensorFlow library, to create a neural network.

Now that we have looked at creating neural networks, we will explore the maths behind it, as well as the processes involved and what we can use keras for, including transfer learning.

Having learnt all about machine learning, we will now learn how to integrate a machine learning model into a website.

In our final module, you will present your machine learning projects. We will discuss them, and then finish the course by talking over next steps for your new career in data science.

Interest free payment plans

Study now, pay later

In our commitment to supporting your career in tech, we’ve collaborated with StepEx to introduce affordable financing options tailored to your needs.

We appreciate the diverse paths individuals take to pursue education and career goals. With StepEx financing, we aim to make these paths more accessible by allowing you to split your tuition fees into manageable monthly payments:

• Crafted for those seeking financial flexibility, allowing repayment through manageable monthly instalments.
• Interest-Free
• Equal monthly payments over a fixed period of time.
• First repayment starts 3 months from the course end date.
• Available options include repayment periods of 12-month, 24-month and 36-month.

For more details, visit the StepEx website.

The above financial offers and the credit provision itself are provided by StepEx Lender Ltd, a consumer credit lender authorised by the Financial Conduct Authority under FRN 824928.

The course has helped [our team] to present new ways of working with our data and we are excited to explore where this will take our product offering this year.
Matthew Hornsey Co-founder & Director, Phronesis

Get ready to learn

No coding experience? No problem!

Want to break into the world of data science and machine learning but don’t know how to code? Fear not! Our three-day Intro to Python workshop provides all the foundational knowledge and practical experience you’ll need to get up to speed ready to start this course.

Industry expert

Richard

Data Science Trainer

I love being able to bring my industry experience into lessons, providing students with a real-world perspective of data science

Cookie	Duration	Description
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
li_gc	6 months	Linkedin set this cookie for storing visitor's consent regarding using cookies for non-essential purposes.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_M24J9ZVFTV	2 years	This cookie is installed by Google Analytics.
_gat_UA-187195404-1	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_hjAbsoluteSessionInProgress	30 minutes	Hotjar sets this cookie to detect the first pageview session of a user. This is a True/False flag set by the cookie.
_hjFirstSeen	30 minutes	Hotjar sets this cookie to identify a new user’s first session. It stores a true/false value, indicating whether it was the first time Hotjar saw this user.
_hjIncludedInPageviewSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's pageview limit.
_hjIncludedInSessionSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's daily session limit.
AnalyticsSyncHistory	1 month	Linkedin set this cookie to store information about the time a sync took place with the lms_analytics cookie.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
guest_id	2 years	Twitter installs the guest_id cookie to enable Twitter integration and for social media advertising. This cookie helps to track user behaviour for marketing, to enable sign in and personalize the user's Twitter experience across devices.
ln_or	1 day	Linkedin sets this cookie to registers statistical data on users' behaviour on the website for internal analytics.

Cookie	Duration	Description
muc_ads	2 years	Twitter set this cookie to collect visitor navigation data to optimise ad relevance.
personalization_id	2 years	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.

Cookie	Duration	Description
_hjSession_3112113	30 minutes	No description
_hjSessionUser_3112113	1 year	No description
DEVICE_INFO	5 months 27 days	No description
guest_id_ads	2 years	No description
guest_id_marketing	2 years	No description

Software Development

Data Science & Machine Learning

Professional Courses

Info

Life at iO

Hire our developers

Data Science Pathway

Get ready to join the machine learning revolution and discover a new career in one of tech’s fastest growing sectors.

Course summary

Duration

Fees

Locations

Contact

Overview

The world is data driven

Curriculum

What you'll learn

Python crash course

Intro to data science

Foundational maths & descriptive statistics

Probability

Hypothesis testing & inferential statistics

Data science tools

Visualisation techniques

Loading, reading and cleaning data 1

Loading, reading and cleaning data 2

Advanced data engineering methods

Applying to real datasets

Report presentation

Introduction to machine learning with Sklearn

Regression techniques

Classification techniques

Working with time series data

Interpreting ML models

Optimising machine learning models

End to end data science and machine learning project

Introduction to deep learning

Introduction to keras

Convolutional neural networks

Machine learning integration

End to end project presentation