The challenge
Inspectiondepartmentsoftenholdlargeamountsofinformation,ofvariablequalityandinvarious
formats.Whenconsideringtheprocurementofanewinspectionmanagementsolution,
inspectionprofessionalsareusuallykeentounderstandhowaprospectivevendorwillmanage
thetransferoftheirdatatothevendorssystem.
ThiswhitepaperwilloutlineanapproachthatCredosoftrecommendtoensureasuccessful
datamigrationproject.Pleasenotethatthisisnotanexhaustiveordetailedlist..
Whilsttechnologywillbeutilisedtomanagethephysicalextraction,transformationandloading
ofyourdata,toooftendatamigrationisviewedasapurelytechnicalexercise.Therealityisthat
onlytheownersandconsumersofthedataunderstandthemeaningandvalueofthedata.This
Page1of5
CredosoftwhitepaperV0.1
valuemustberetainedandenhancedduringdatamigration.Emphasisingtechnologyover
businessknowledgeshouldbeavoided.
Datamigrationisabusinessissue,notatechnicalissue
Theinspectionteamaretheexpertsinwhatthedepartmentdoes.Theyhavebeenrunningthe
legacysystemsandthesystemshavebeenrunningtheinspectionactivities.Finallythe
inspectionteamhavetheexpertisetomakejudgementsaboutthequalityandusefulnessofthe
data.Thatbeingsaidthattheclientwillrightlyexpectguidanceonhowtomakethedatafitthe
newinspectionsystem.
Thedatamigrationanalystcannotknowmoreaboutthebusinessrulesthantheinspection
team
2. Data profiling
Theinspectionteamknowwhereallthedatasourcesthatrunthedepartmentarelocated.Early
identificationofallpossibledatasourcesisvitaltoensureaccurateestimatingoftimescales.
Departmentsoftenhavevariousspreadsheetsorminidatabaseswhichsupporttheirdaily
inspectionactivities.Somemaybeunacknowledgedhowevercouldbecriticaltothe
departmentsprocessessometimesaddressinggapsinintheofficialsystem.Eachwillbein
differentformatsandvaryingqualityandtherewilloftenbedifficultiesinlinkingthemtogether.
Onceallthelegacydatastoreshavebeenidentified,thisisthepointwhereyoumayagreethe
scopeofwork
Atthisstagewediscoverandcatalogallthedatastoresandtheirrelationshiptooneanother.
Welookinsidethedatastorestounderstandhowtheywork,whatdatatheyholdandwhat
challengestheymayhold.
Finally,weneedtoagreeonwhatdatayouwanttomigrate.Inanidealworldyouwouldmigrate
everythinghowevertherealityisthattherewillbedatawhichmaynotaddvalueormaybe
coveredbyadifferentsource.
Page2of5
CredosoftwhitepaperV0.1
Wheredomainexpertscheckthedatatoensureitmatches
businessreality
Internalgaps
Wherethedatadoesnotconformtointernalrulesforexample
referentialintegritydoesnotexist
Migrationmodelgaps
Thisoccurswhenthereisdifferencesinthedatafromdifferent
legacydatastores
Targetmodelgaps
Thisoccurswhentherearedifferencesindataanddata
structuresbetweenthesourceandtargetsystems
Mapping
Thisisthelinkingoffieldsinthelegacydatastorestofieldsinthetarget.Mappingrequiresclose
collaborationbetweentheclientandthevendor.Theuseofaspreadsheetisperfectly
acceptableforthisactivity.
DESCRIPTION
Ignore
Theseareoftenlowleveldataissuesthatdontcauseaproblem.
Itslikelythatthemajorityofissueswillfallintothiscategory.In
somecasesyoumaybecontenttocarrytheissueintothenew
systembecauseyouhavebiggerissuestoresolve..
Fixinflight
Inthiscaseyouwillperformsomekindoftransformationonthe
datatocorrectit.Forexample,convertlookupcodestoupper
Page3of5
CredosoftwhitepaperV0.1
case.Thisactivityiscarriedoutasthedatainloadedintothe
target
Fixinsource
Inthiscaseyoumayfixthedatainthesourcedatasystem.This
maybethesimplestsolution.
Fixintarget
Inthiscaseyoumayfixthedataafterithasbeenloadedintothe
targetsystem.Howeverifthedataissuesbreaksvalidationrulesin
thetarget,youmayhavetotemporarilydisablethevalidationrules
toallthedatatopassintothetarget.Thisisperhapstheriskiest
optionandinmostcaseswouldbeunnecessary..
BigBang
Allthedataismovedinonego.Thisisthemostcommonapproach
Phased
Thedataismovedinseparateparts,perhapsbybusinessorgeographiclocation
Parallel
Thedataismovedtothetarget,howeverthelegacydatacontinuestobeusedand
changesaresynchronisedtothetarget.
Page4of5
CredosoftwhitepaperV0.1
8. Load
Amigrationcontroller,oftenknownastheextract,transformandloadtoolisused(ETL)toload
thedataintothetargetsystem.Thevendormayuseasinhouseorathirdpartytool.Thesetools
oftendomorethanjustthesethreetasksandthefollowingarekeyfeaturesyoushouldexpect:
Readthedatafromthelegacydatastore
Validatethedata
Reformatthedataandmergefrommultiplesources
Startandstoptheprocess
Writethedatatothetargetdatabase
Managedataerrors
Providereportsonexecution
Provideanaudittrail
Synchroniseintheeventthatthesourcedatamustbeusedduringthemigration,
changesinthesourcecanbeupdatedwiththetarget
Page5of5