Fake it 'til you make it: production test data at scale
Explore open source tools & data sources for generating realistic synthetic data, enabling better product design, testing, and secure engineering practices.
About this talk
Data Scotland 2022
Many organisations provide digital products or services that need to handle personally identifiable information. The challenge is providing product and engineering teams with a sufficient volume of realistic looking synthetic data to enable them to design, develop and test their solutions.
Barry presents open source tools and open data sources that can be used to tackle this challenge, and then demos this in action to generate thousands of synthetic customers.
He describes how this approach can be used to build better products, to test products using production quality data at production scale, and embed data quality and best practice information security practices in your engineering processes.