- This event has passed.
Predictive analytics, machine learning and data science for big data: Auckland, 1–2 November 2018
1 November 2018 @ 9:30 am - 2 November 2018 @ 5:00 pm
Secure your place today
Our leading course has transformed the machine-learning and data-science practice of the many managers, sponsors, key stakeholders, entrepreneurs and beginning data-science practitioners who have attended it.
This course is an intuitive, hands-on introduction to data science and machine learning. The training focuses on central concepts and key skills, leaving the trainee with a deep understanding of the foundations of data science and even some of the more advanced tools used in the field.
The course also covers key issues of data science practice in a work environment, and directs trainees to a range of further learning directions.
The skills taught are transferable to all software platforms, and the course does not involve coding, or require any coding knowledge or experience. A tool with a graphical user interface is used so trainees can focus on learning the central skills and ideas.
Key skills taught include building, assessing, selecting and deploying predictive models, as well as employing some of the most commonly used methods in the field, including general linear models (GLMs), and advanced methods such as random forests.
Earlybird pricing is available until 18 October 2018.
Group discounts also apply during the earlybird period: 5% for 2–4 people, 10% for 5–6 people, 15% for 7–8 people, and 20% for 9 or more people. Please contact us at firstname.lastname@example.org to take advantage of these special rates.
This course will provide a conceptual overview and practical hands-on experience of a wide range of key tools, techniques and processes.
At the heart of the data mining toolkit is the suite of predictive modelling methods. Accordingly, the course will develop attendees’ literacy in the strengths, characteristics and correct application of a range of predictive modelling methods, from relatively simple linear models through to complex and powerful Random Forests, Support Vector Machines, Decision Trees, Tree Boosting Machines and Neural Networks will be covered along the way.
It will also teach the correct framing of predictive modelling problems, suitably preparing data, evaluating model accuracy and stability, interpreting results and interrogating models.
The two key styles of predictive modelling – operational for targeting and explanatory for insights – will be described and distinguished.
As well as predictive modelling, the course will cover a range of other key data mining tools, including:
- Data exploration and visualisation: univariate summaries, correlation matrices, heat maps, hierarchical clustering.
- Cluster analysis – used for customer segmentation and anomaly detection
- Other “unsupervised” outlier detection tools.
This course will primarily be taught using Rattle, a graphical interface for predictive modelling and data science in R. Participants will be exposed to “Big Data” techniques as applied to machine learning and deployed on Cloud Computing platforms.
Day 1 of the course covers the basics of machine learning and predictive modelling: what it is, what it isn’t, where it fits, and what the major buzzphrases like machine learning, AI, data science and “big data” actually mean. It also introduces exploratory data analysis and visualisation through the use of an example data set. Next, attendees will build predictive models using decision trees and generalised linear models, and experience a detailed discussion of the workings of these methods.
What then follows is the most important part of the course—
understanding model accuracy: what it is; why it’s tricky to define, let alone measure; and why it is the holy grail of machine learning and the one KPI that any manager or sponsor of the analytics function must understand. Model accuracy is not just a technical detail.
Trainees then build on this understanding to use out-of-sample model accuracy to build, evaluate and select models.
Day 2 builds on day 1, with an increased practical focus.
Day 2 introduces classification error measurement methods, which are then applied in a model tuning and selection exercise. Business-relevant value measurement is introduced in this section. The previous day’s model building, evaluation and selection, and day 2’s model tuning, are completed with model deployment—predictions on new data.
There follows an explanation of other use cases, and of more advanced error measurement methods, particularly the AUC measure.
Then there is an explanation of model degradation and the importance of evaluating and tracking models over time.
The final parts of the course tend to focus on two key topics:
k-fold cross-validation, which is vital in many business applications, and the random forest method, which is easy to use, powerful, and highly versatile, giving beginners strong capabilities for using data for prediction or storytelling.
The following additional topics may be covered depending on the pace and interests of the class:
- Link and network analysis visualisation – which provide a simple and compelling way to communicate and analyse relationships, and are commonly applied in forensics, human resources and law enforcement.
- Association analysis – used in retail market basket analysis and the assessment of risk groupings.
- Frequent item set analysis.
Who should attend?
This course is suitable for anyone in management, administrative, product, marketing, finance, risk and IT roles who works with data and wants to become acquainted with modern data analysis tools.
No prior knowledge of R is required to take this course.
Attendees should, by the end of the course:
- Learn fundamentals of predictive modelling and experience using a range of methods.
- Have improved their ability to assess the effectiveness and fitness for purpose of any predictive modelling tool or technique.
- Have experience with a range of unsupervised data techniques.
- Be exposed to Big Data and Cloud Computing applications.
Courses are taught by Dr Eugene Dubossarsky and his hand-picked team of highly skilled instructors.
About our training
Eugene Dubossarsky’s courses are unlike those offered in universities, online, or by private providers. His data-science classes, in particular, give clients not just knowledge of a process, but the real power of understanding the underlying concepts, allowing them to confidently practice, manage, promote and risk-assess data science.
Dr Dubossarsky says “the way many courses teach data science is like teaching people to memorise and recite poetry in a language they do not understand”. By contrast, he confers an understanding of that language, taught in an intuitive, accessible way that leaves trainees with an instinct for data science. Keeping formulae and mathematics to a bare minimum and taking an intuitive, visual approach, Eugene’s courses deliver a compressed mentoring experience as much as they do content. This is difficult for an average trainer to replicate. Trainees benefit from his extensive knowledge and over 20 years of commercial data-science experience, as well as his unique teaching style.
The resulting testimonials speak for themselves, and candidates come from all walks of life: CEOs, general managers, salespeople, IT professionals, marketing staff, public servants and of course people from many functions in the finance world. These testimonials are extensive, and many more are available on request. With specific regard to finance, Eugene has mentored and advised senior leaders and their teams in a number of major Australian banks.
Having studied stats at Uni I was surprised how far the field has progressed in the last few years, particularly in the area of big data. The great thing about Eugene’s course is I left with a sense that I was up to date with the latest big data modelling concepts but more importantly could also deploy them with some confidence using R. Eugene also made it clear he was available to answer questions after the course, so you are not left hanging. I would absolutely recommend this!
—Damon Rasheed, CEO, Rate Detective
For someone who does not come from an IT background R is a terrifying program. Before doing the Introduction to R course I had previously done other courses in R but always found myself in over my head because they assumed a high level of program experience (even course that required no prior programming knowledge). This course is not like that at all. It starts at ground zero and teaches you everything you need to know to be able to use R confidently in your everyday workplace. It is a must attend for anyone who wants use R!
Data science can be a challenging topic but Eugene’s “Introduction to Machine Learning” course turns complex statistical models into plain English. The course contents and presentation were accessible and I enjoyed the mixture of hands-on rattle() exercises, the challenge of building multiple models with real life data, and the salient theory whiteboard discussions created many “aha” moments.
It was a great introductory course and it gave me with a better grasp of Machine Learning in general, a great framework for thinking about it and practical hands-on skills that I can put to immediate use. I wish I had done this course sooner.
—Charl Swart, Director of Business Operations, Unisys Credit Services
Questions and further details
Meals and refreshments
Catered morning tea and lunch are provided on both days of the course. Please notify us at least a week ahead if you have any special dietary requirements.
Use email@example.com to email us any questions about the course, including requests for more detail, or for specific content you would like to see covered, or queries regarding prerequisites and suitability.
If you would like to attend but for any reason cannot, please also let us know.
Course material may vary from advertised due to demands and learning pace of attendees. Additional material may be presented, along with or in place of advertised.
Cancellations and refunds
You can get a full refund if you cancel 2 weeks or more before the course starts. No refunds will be issued for cancellations made less than 2 weeks before the course starts.
Frequently asked questions (FAQ)
Do I need to bring my own computer?
There’s no need to bring your own laptop or PC. Our courses take place in modern, professional training facilities that have all the computing equipment you’ll need.
I’m lost! How do I find the venue?
Presciient training, coaching, mentoring and capability development for analytics
Please ask about tailored, in-house training courses, coaching analytics teams, executive mentoring and strategic advice and other services to build your organisation’s strategic and operational analytics capability.
Our courses include:
- Predictive Analytics, Machine Learning, Data Science and AI
- Data Literacy for Everyone
- Introduction to R and Data Visualisation
- Introduction to Python for Data Analysis
- Forecasting and Trend Analytics
- Advanced Machine Learning Masterclass
- Advanced Masterclass 2: Random Forests
- Advanced R
- Quantum Computing
- Text and Language Analytics
- Fraud and Anomaly Detection
- Introduction to Machine Learning
- Introduction to Data Science
- Kaggle Boot Camp
By booking this course, you agree to our terms and conditions.
For any enquiries, please call 0800-424282 (toll-free).
If you prefer, you can pay by invoice rather than credit card. Just select “Pay by invoice” at the checkout.