APR Nov/Dec 2022 - 71
« FORMULATION AND DEVELOPMENT
What's Keeping Your Enterprise from Fully Leveraging
Its Data?
Jim Olson
CEO
Flywheel
Research & development (R&D) processes are increasingly being
redefined within the framework of digital transformation. With these
initiatives, pharma organizations aim to improve speed to market and
reduce the considerable R&D costs of drug discovery and development.
Artificial intelligence (AI) and machine learning (ML) are key parts
of this transformation. However, organizations are discovering that
harnessing these approaches isn't the work of just a few weeks or
months. Some companies have gone through several iterations of
digital transformation in R&D, with costly failures along the way. A
recurring issue among these iterations has been the problem of how
to efficiently de-silo data and enable enterprise-wide access.
Machine learning is data-hungry (especially for diverse data), and
unless research enterprises have outstanding data governance models
in place, they face challenges in realizing a true digital transformation.
Often, the challenge is in breaking data out of institutional silos, uniting
them in a central repository (or a centrally managed and normalized
set of repositories), and standardizing them for machine learning.
Decades of ingrained culture, pieced-together tech stacks, and
homegrown systems make strong headwinds for life science leaders
as they seek to move their research teams forward. But the risks
of the status quo are impossible to ignore: without modern data
management, organizations are wasting money, missing innovative
opportunities, under-leveraging their assets, and potentially even
facing compliance risks.
Why Is Standardizing Data
Such a Challenge?
It doesn't take long to understand at a high level why data management
is so difficult for pharma companies. The data they hold are old and
new, simple and complex, and sourced from multiple places, ranging
from internal or external research results to clinical trials that are still
underway. In addition to what is located in a company's own archives,
data can be held externally by development partners such as contract
research organizations (CROs) and clinical sites.
To explore just one example of the difficulty in standardizing even
newly captured data, consider a global clinical trial with hundreds
of patients that has prescribed an imaging protocol with five types
of examinations for every patient at multiple time points. With
imaging being performed via multiple sites, devices, providers, and
languages, this trial could generate thousands of unique metadata
tags and descriptions.
As this example illustrates, the sum of an organization's data, having
been captured over time by different researchers, different devices,
and using different organizational conventions, results in a collection
of heterogeneous data with untold variation, even if data is provided in
the same format (e.g., DICOM). The older the data, the more challenges
may arise when attempting to curate those data. Over a dataset's
lifecycle, as the data have moved through transfer and analysis
pipelines and changed format, the likelihood that metadata have
been manipulated or even removed increases. Significant portions
of the data will likely predate AI/ML and will not have been acquired,
archived or organized with such applications in mind. However, as
already mentioned, ML is data-hungry, and even legacy data or data
that are part of long-term longitudinal studies with issues like these
are worth curating to create bigger datasets for training.
Traditionally, in order to harness disparate data for ML, research teams
must spend countless hours locating datasets and manually curating
them to a common standard. This curation entails tasks including
selection, classification, transformation, validation, and preservation
of research data and supporting material. The time required by this
www.americanpharmaceuticalreview.com |
| 71
»
http://www.americanpharmaceuticalreview.com
APR Nov/Dec 2022
Table of Contents for the Digital Edition of APR Nov/Dec 2022
APR Nov/Dec 2022 - Cover1
APR Nov/Dec 2022 - Cover2
APR Nov/Dec 2022 - 1
APR Nov/Dec 2022 - 2
APR Nov/Dec 2022 - 3
APR Nov/Dec 2022 - 4
APR Nov/Dec 2022 - 5
APR Nov/Dec 2022 - 6
APR Nov/Dec 2022 - 7
APR Nov/Dec 2022 - 8
APR Nov/Dec 2022 - 9
APR Nov/Dec 2022 - 10
APR Nov/Dec 2022 - 11
APR Nov/Dec 2022 - 12
APR Nov/Dec 2022 - 13
APR Nov/Dec 2022 - 14
APR Nov/Dec 2022 - 15
APR Nov/Dec 2022 - 16
APR Nov/Dec 2022 - 17
APR Nov/Dec 2022 - 18
APR Nov/Dec 2022 - 19
APR Nov/Dec 2022 - 20
APR Nov/Dec 2022 - 21
APR Nov/Dec 2022 - 22
APR Nov/Dec 2022 - 23
APR Nov/Dec 2022 - 24
APR Nov/Dec 2022 - 25
APR Nov/Dec 2022 - 26
APR Nov/Dec 2022 - 27
APR Nov/Dec 2022 - 28
APR Nov/Dec 2022 - 29
APR Nov/Dec 2022 - 30
APR Nov/Dec 2022 - 31
APR Nov/Dec 2022 - 32
APR Nov/Dec 2022 - 33
APR Nov/Dec 2022 - 34
APR Nov/Dec 2022 - 35
APR Nov/Dec 2022 - 36
APR Nov/Dec 2022 - 37
APR Nov/Dec 2022 - 38
APR Nov/Dec 2022 - 39
APR Nov/Dec 2022 - 40
APR Nov/Dec 2022 - 41
APR Nov/Dec 2022 - 42
APR Nov/Dec 2022 - 43
APR Nov/Dec 2022 - 44
APR Nov/Dec 2022 - 45
APR Nov/Dec 2022 - 46
APR Nov/Dec 2022 - 47
APR Nov/Dec 2022 - 48
APR Nov/Dec 2022 - 49
APR Nov/Dec 2022 - 50
APR Nov/Dec 2022 - 51
APR Nov/Dec 2022 - 52
APR Nov/Dec 2022 - 53
APR Nov/Dec 2022 - 54
APR Nov/Dec 2022 - 55
APR Nov/Dec 2022 - 56
APR Nov/Dec 2022 - 57
APR Nov/Dec 2022 - 58
APR Nov/Dec 2022 - 59
APR Nov/Dec 2022 - 60
APR Nov/Dec 2022 - 61
APR Nov/Dec 2022 - 62
APR Nov/Dec 2022 - 63
APR Nov/Dec 2022 - 64
APR Nov/Dec 2022 - 65
APR Nov/Dec 2022 - 66
APR Nov/Dec 2022 - 67
APR Nov/Dec 2022 - 68
APR Nov/Dec 2022 - 69
APR Nov/Dec 2022 - 70
APR Nov/Dec 2022 - 71
APR Nov/Dec 2022 - 72
APR Nov/Dec 2022 - 73
APR Nov/Dec 2022 - 74
APR Nov/Dec 2022 - 75
APR Nov/Dec 2022 - 76
APR Nov/Dec 2022 - 77
APR Nov/Dec 2022 - 78
APR Nov/Dec 2022 - 79
APR Nov/Dec 2022 - 80
APR Nov/Dec 2022 - 81
APR Nov/Dec 2022 - 82
APR Nov/Dec 2022 - 83
APR Nov/Dec 2022 - 84
APR Nov/Dec 2022 - Cover3
APR Nov/Dec 2022 - Cover4
https://www.nxtbookmedia.com