Building a ready to use data system from the ground up before data exists

Conference Expert Big Data

2023-10-19 | 02:30 PM - 03:00 PM | Conference Room 1 (203 - ground level)


At Sanofi’s Toronto site, a new bulk vaccine facility with over CAD 500 million in investment is getting ready to start manufacturing operations. This new facility, named Building 100 (B100), has presented an opportunity for the organization to build a data system that would provide ready-to-use (RTU) data to the end-users from day one.  In the past, such data systems were developed months after a project reached the execution stage, as it allowed data management teams to know where to find data, build data management systems and test data connections using executed batch data. However, this meant that the business did not have access to RTU data for supporting and troubleshooting the initial execution phase of the operations, including engineering batches.  Against this backdrop, Data Science Team on site embarked on a journey to build a data contextualization pipeline even before the start of any operational activities in B100. That meant building a data system before data existed.
B100 with its state-of-the-art automation systems, manufacturing execution system (MES) and PI data historian systems, represented a challenging situation to access business ready data. Raw data from MES and PI are messy and not easily readable.  Data from MES, PI, laboratory information management system (LIMS) and enterprise resource planning (ERP) system needed to be contextualized so that end-users can avoid non-value-added manual efforts. This required cross-functional partnerships between different stakeholders to understand how data is structured in different source systems, capture end-user requirements, and build novel data engineering solutions.
During this presentation, we will share our experience in building a RTU solution, lessons learned and how this solution is helping our digital transformation journey.