The 6-step Data Quality Method: Birmingham Training WorkshopInfo Location Attendee Categories Contact More Info Event Information![]()
DescriptionConsider these three facts: (a) There are more than 100 ways in which data can be of “low”' quality, (b) Data preparation (which includes data quality checks) often takes more than half of a data science project’s time, and (c) Few data scientists or analysts have had much training in data quality. https://github.com/royruddle/6-step-data-quality-method
Event Location![]()
Attendee CategoriesRegistration fee includes catering £250
Additional ItemsContactIf you have any questions about this event, please email lida@leeds.ac.uk More InformationIn this practical workshop, you will learn an efficient and rigorous method for investigating data quality that can: 1. Save you time, by defining a set of tasks and questions to ask about your data 2. Reduce cost by avoiding re-work 3. Improve your results, by correcting your assumptions and understanding any limitations of your data Structure The workshop is based on the 6-step Data Quality Method that we developed through 10+ years of personal experience and with input from data scientists and analysts working in 15 industry sectors. The workshop covers a what, when, how and why of investigating data quality and is divided into these parts: · Welcome and introductions. · What? Turning data quality issues into plain English tasks & questions for you to answer about your data. · When? Prioritising the tasks to do first and which to leave until later. · Practical challenges in small groups, involving tabular, spatial and longitudinal data. · How and why? Discussing how you tackled each challenge and seeing exemplar solutions. Is this workshop for me? The workshop is open to everyone, irrespective of your level of knowledge about data quality. The workshop is designed for: · Data scientists and analysts, who need to know how to properly investigate data quality in their work. · Technical leaders, who know the adage “garbage in … garbage out”, but not the many ways in which data may be wrong or the effects that can have. · Application domain clients, who need sufficient knowledge to have informed discussions with the teams who are delivering their projects. You may use any software for the practical challenges, bringing with you a laptop with that software installed (e.g., Python, R or Excel). Workshop tutor Roy Ruddle is a Professor of Computing at the University of Leeds, and Director of Research Technology at the Leeds Institute for Data Analytics (LIDA). Roy has worked in both industry and academia. He is an expert in data visualization and data quality, with decades of experience spanning hospital & GP data, radio astronomy, supermarkets, explainable AI and even train doors! His Leeds Virtual Microscope (LVM) was commercialised by the global healthcare company Roche, and his petrophysics visual data analysis software was commercialised by the spin-off Petriva Ltd. Roy regularly gives talks and continuing professional development (CPD) training workshops (https://github.com/royruddle/tutorials-and-talks). | ||||||||||||||||

