Ab Initioã«ã¯ãç¬èªã®PDLã§æ¡åŒµã§ããå€ãã®å€å žçã§çããå€æããããŸããäžå°äŒæ¥ã®å Žåããã®ãããªåŒ·åãªããŒã«ã¯åé·ã§ããå¯èœæ§ãé«ãããã®æ©èœã®ã»ãšãã©ã¯é«äŸ¡ã§äžèŠãªå ŽåããããŸããããããèŠæš¡ãSberbankã®èŠæš¡ã«è¿ãå ŽåãAb Initioã¯èå³æ·±ããããããŸããã
ããã¯ãããžãã¹ãã°ããŒãã«ã«ç¥èãèç©ãããšã³ã·ã¹ãã ãéçºããã®ã«åœ¹ç«ã¡ãéçºè -ETLã§ã¹ãã«ãåŒãåºããç¥èãã·ã§ã«ã«åŒãäžããPDLèšèªãç¿åŸããæ©äŒãæäŸããããŒãããã»ã¹ã®èŠèŠçãªå³ãæäŸããè±å¯ãªæ©èœã³ã³ããŒãã³ãã«ããéçºãç°¡çŽ åããŸãã
ãã®æçš¿ã§ã¯ãAb Initioã®æ©èœã«ã€ããŠèª¬æããHiveãGreenPlumãšã®æ¯èŒã«ã€ããŠèª¬æããŸãã
- MDW GreenPlum
- Ab Initio Hive GreenPlum
- Ab Initio GreenPlum Near Real Time
ãã®è£œåã®æ©èœã¯éåžžã«å¹ åºããç¿åŸã«ã¯å€ãã®æéãããããŸãããã ããé©åãªäœæ¥ã¹ãã«ãšé©åãªããã©ãŒãã³ã¹èšå®ãããã°ãããŒã¿åŠççµæã¯éåžžã«å°è±¡çã§ããéçºè ã«Ab Initioã䜿çšãããšã圌ã«èå³æ·±ãäœéšãäžããããšãã§ããŸããããã¯ETLéçºã®æ°ããèŠæ¹ã§ãããããžã¥ã¢ã«ç°å¢ãšã¹ã¯ãªããã®ãããªèšèªã§ã®ããŠã³ããŒãéçºã®ãã€ããªããã§ãã
ããžãã¹ã¯ãã®ãšã³ã·ã¹ãã ãéçºãããã®ããŒã«ã¯ãããŸã§ä»¥äžã«éå®ããŠããŸããAb Initioã®å©ããåããŠãçŸåšã®ããžãã¹ã«é¢ããç¥èãèç©ãããã®ç¥èã䜿çšããŠãå€ãããžãã¹ãæ°ããããžãã¹ãæ¡å€§ããããšãã§ããŸããAb Initioã®ä»£æ¿ã¯ãããžã¥ã¢ã«éçºç°å¢Informatica BDMããã³éããžã¥ã¢ã«ç°å¢-Apache SparkããåŒã³åºãããšãã§ããŸãã
Ab Initioã®èª¬æ
Ab Initioã¯ãä»ã®ETLããŒã«ãšåæ§ã«ã補åã®ã¹ã€ãŒãã§ãã
Ab Initio GDEïŒã°ã©ãã£ã«ã«éçºç°å¢ïŒã¯ãããŒã¿å€æãèšå®ããããããç¢å°ã®åœ¢ã§ããŒã¿ã¹ããªãŒã ã«æ¥ç¶ããéçºè åãã®ç°å¢ã§ãããã®å Žåããã®ãããªäžé£ã®å€æã¯ã°ã©ããšåŒã°ããŸãã
æ©èœã³ã³ããŒãã³ãã®å ¥åããã³åºåæ¥ç¶ã¯ããŒãã§ãããå€æå ã§èšç®ããããã£ãŒã«ããå«ã¿ãŸããå®è¡é ã«ç¢å°ã®åœ¢ã§ã¹ããªãŒã ã«ãã£ãŠæ¥ç¶ãããããã€ãã®ã°ã©ãã¯ããã©ã³ãšåŒã°ããŸãã
æ°çŸã®æ©èœã³ã³ããŒãã³ãããããããã¯ãããããããŸãããããã®å€ãã¯é«åºŠã«å°éåãããŠããŸãã Ab Initioã«ã¯ãä»ã®ETLããŒã«ãããå¹ åºãåŸæ¥ã®å€æããããŸããããšãã°ãçµåã«ã¯è€æ°ã®åºåããããŸããããŒã¿ã»ãããæ¥ç¶ããçµæã«å ããŠãããŒãæ¥ç¶ã§ããªãã£ãå ¥åããŒã¿ã»ããã®ã¬ã³ãŒãã®åºåãååŸã§ããŸããæåŠããšã©ãŒãããã³å€ææäœã®ãã°ãååŸããããšãã§ããŸãããããã¯ãããã¹ããã¡ã€ã«ãšåãåã§èªã¿åããä»ã®å€æã§åŠç
ã§ããŸããããšãã°ãããŒã¿ã¬ã·ãŒããŒãããŒãã«ã®åœ¢åŒã§å ·äœåããããããåãåã§ããŒã¿ãèªã¿åãããšãã§ããŸãã
ãªãªãžãã«ã®å€å®¹ããããŸããããšãã°ãã¹ãã£ã³å€æã«ã¯ãåæé¢æ°ãšåãæ©èœããããŸããããŒã¿ã®äœæãExcelã®èªã¿åããæ£èŠåãã°ã«ãŒãå ã§ã®äžŠã¹æ¿ããããã°ã©ã ã®å®è¡ãSQLã®å®è¡ãDBãšã®çµåãªã©ãããããããååã®å€æããããŸããã°ã©ãã¯ããªãã¬ãŒãã£ã³ã°ã·ã¹ãã ãŸãã¯ãªãã¬ãŒãã£ã³ã°ã·ã¹ãã ãžã®ãã©ã¡ãŒã¿ãŒã®è»¢éãå«ãã©ã³ã¿ã€ã ãã©ã¡ãŒã¿ãŒã䜿çšã§ããŸãã ...ã°ã©ãã«æž¡ãããæ¢è£œã®ãã©ã¡ãŒã¿ãŒã»ãããå«ããã¡ã€ã«ã¯ããã©ã¡ãŒã¿ãŒã»ããïŒpsetsïŒãšåŒã°ããŸãã
äºæ³éããAb Initio GDEã«ã¯EMEïŒEnterprise Meta EnvironmentïŒãšåŒã°ããç¬èªã®ãªããžããªããããŸããéçºè ã¯ãããŒã«ã«ããŒãžã§ã³ã®ã³ãŒãã䜿çšããŠãéçºå 容ãäžå€®ãªããžããªã«ãã§ãã¯ã€ã³ããããšãã§ããŸãã
å®è¡äžãŸãã¯ã°ã©ãã®å®è¡åŸã«ãå€æãæ¥ç¶ããã¹ããªãŒã ãã¯ãªãã¯ããŠããããã®å€æéã§æž¡ãããããŒã¿ã確èªãã
ããšãã§ããŸããã¹ããªãŒã ãã¯ãªãã¯ããŠã远跡ã®è©³çŽ°ã確èªããããšãã§ããŸãããã©ã¬ã«ãããŒããããŸãïŒ
ã°ã©ãã®å®è¡ããã§ãŒãºã«åå²ããæåã«ïŒãã§ãŒãº0ã§ïŒæåã®ãã§ãŒãºã«ç¶ããŠã2çªç®ã®ãã§ãŒãºã«ç¶ããŠãããã€ãã®å€æãå®è¡ããå¿ èŠãããããšãããŒã¯ããããšãã§ããŸãã
å€æããšã«ãããããã¬ã€ã¢ãŠãïŒãããå®è¡ãããå ŽæïŒãéžæã§ããŸãããã©ã¬ã«ãªããŸãã¯ãã©ã¬ã«ã¹ã¬ããå ã§ããã®æ°ãèšå®ã§ããŸããåæã«ãå€æäœæ¥äžã«Ab Initioã«ãã£ãŠäœæãããäžæãã¡ã€ã«ã¯ããµãŒããŒãã¡ã€ã«ã·ã¹ãã ãšHDFSã®äž¡æ¹ã«é 眮ã§ããŸãã
åå€æã§ã¯ãããã©ã«ãã®ãã³ãã¬ãŒãã«åºã¥ããŠãã·ã§ã«ã®ãããªPDLèšèªã§ç¬èªã®ã¹ã¯ãªãããäœæã§ããŸãã
PDLèšèªã䜿çšãããšãå€æã®æ©èœãæ¡åŒµã§ããç¹ã«ãã©ã³ã¿ã€ã ãã©ã¡ãŒã¿ã«å¿ããŠåçã«ïŒå®è¡æã«ïŒä»»æã®ã³ãŒããã©ã°ã¡ã³ããçæã§ããŸãã
ãŸããAb Initioã¯ãã·ã§ã«ãä»ããŠOSãšååã«çµ±åãããŠããŸããå ·äœçã«ã¯ãSberbankã¯linux kshã䜿çšããŸããå€æ°ãã·ã§ã«ãšäº€æããŠãã°ã©ããã©ã¡ãŒã¿ãŒãšããŠäœ¿çšã§ããŸããã·ã§ã«ããAb Initioã°ã©ãã®å®è¡ãåŒã³åºããŠãAb Initioã管çã§ããŸãã
Ab Initio GDEã«å ããŠãé ä¿¡ã«ã¯ä»ã®å€ãã®è£œåãå«ãŸããŠããŸãããªãã¬ãŒãã£ã³ã°ã·ã¹ãã ãšåŒã°ããŠããCo>æäœã·ã¹ãã ããããŸããã³ã³ãããŒã«>ã»ã³ã¿ãŒããããããŠã³ããŒãã¹ããªãŒã ãã¹ã±ãžã¥ãŒã«ããã³ç£èŠã§ããŸããAb Initio GDEãèš±å¯ãããããããªããã£ãã¬ãã«ã§éçºãè¡ãããã®è£œåããããŸãã
MDWãã¬ãŒã ã¯ãŒã¯ã®èª¬æãšGreenPlumã®ã«ã¹ã¿ãã€ãºã«é¢ããäœæ¥
ãã³ããŒã¯ã補åãšäžç·ã«ã補åMDWïŒã¡ã¿ããŒã¿ããªãã³ãŠã§ã¢ããŠã¹ïŒãæäŸããŸããMDWã¯ãããŒã¿ãŠã§ã¢ããŠã¹ãŸãã¯ããŒã¿ã³ã³ãããŒã«ããŒã¿ãå ¥åããäžè¬çãªã¿ã¹ã¯ãæ¯æŽããããã«èšèšãããã°ã©ãã³ã³ãã£ã®ã¥ã¬ãŒã¿ãŒã§ãã
ã«ã¹ã¿ã ïŒãããžã§ã¯ãåºæã®ïŒã¡ã¿ããŒã¿ããŒãµãŒãšããã«äœ¿çšã§ããã³ãŒããžã§ãã¬ãŒã¿ãŒãå«ãŸããŠããŸãã
å ¥ãå£ã§ãMDWã¯ããŒã¿ã¢ãã«ãããŒã¿ããŒã¹æ¥ç¶ãã»ããã¢ããããããã®æ§æãã¡ã€ã«ïŒOracleãTeradataããŸãã¯HiveïŒãšãã®ä»ã®èšå®ãåãåããŸããããšãã°ããããžã§ã¯ãåºæã®éšåã¯ã¢ãã«ãããŒã¿ããŒã¹ã«ãããã€ããŸãã補åã®ããã¯ã¹ã§å²ãŸããéšåã¯ãã¢ãã«ããŒãã«ã«ããŒã¿ãããŒããããšãã°ã©ããšæ§æãã¡ã€ã«ãçæããŸããããã«ããããšã³ãã£ãã£ã®æŽæ°ã«é¢ããåæåããã³å¢åäœæ¥ã®ããã€ãã®ã¢ãŒãã®ã°ã©ãïŒããã³psetïŒãäœæãããŸãã
Hiveããã³RDBMSã®å Žåãç°ãªãåæåããã³å¢åããŒã¿æŽæ°ã°ã©ããçæãããŸãã
Hiveã®å Žåãçä¿¡ãã«ã¿ããŒã¿ã¯ãæŽæ°åã«ããŒãã«å ã«ãã£ãããŒã¿ã«Ab Initio Joinã«ãã£ãŠçµåãããŸãã MDWã®ããŒã¿ããŒããŒïŒHiveãšRDBMSã®äž¡æ¹ïŒã¯ããã«ã¿ããæ°ããããŒã¿ãæ¿å ¥ããã ãã§ãªããäž»ããŒããã«ã¿ãåãåã£ãããŒã¿ã®æå¹æéãéããŸããããã«ãããŒã¿ã®å€æŽãããŠããªãéšåãæžãæããå¿ èŠããããŸãããã ããHiveã«ã¯åé€ãŸãã¯æŽæ°æäœããªãããããããè¡ãå¿ èŠããããŸãã
RDBMSã«ã¯å®éã®æŽæ°æ©èœããããããRDBMSã®å Žåãå¢åããŒã¿æŽæ°ã°ã©ãã¯ããæé©ã«èŠããŸãã
åä¿¡ãããã«ã¿ã¯ãããŒã¿ããŒã¹ã®ã¹ããŒãžã³ã°ããŒãã«ã«ããŒããããŸãããã®åŸããã«ã¿ã¯æŽæ°åã®ããŒãã«ã«ãã£ãããŒã¿ã«æ¥ç¶ãããŸãããããŠãããã¯ãçæãããSQLã¯ãšãªãéããŠSQLã«ãã£ãŠè¡ãããŸãã次ã«ãdelete + insert SQLã³ãã³ãã䜿çšããŠããã«ã¿ããã®æ°ããããŒã¿ãã¿ãŒã²ããããŒãã«ã«æ¿å ¥ãããããŒã¿ã®é¢é£æ§ã®æéãããã«ã¿ãåä¿¡ãããäž»ããŒã«åŸã£ãŠéããããŸãã
å€æŽãããŠããªãããŒã¿ãæžãæããå¿ èŠã¯ãããŸããã
ãããã£ãŠãHiveã«ã¯æŽæ°æ©èœããªããããHiveã®å ŽåãMDWã¯ããŒãã«å šäœãæžãæããå¿ èŠããããšããçµè«ã«éããŸããããããŠãæŽæ°ãçºæãããŠããªããšãã®ããŒã¿ã®å®å šãªæžãæãã«åããã®ã¯ãããŸãããéã«ãRDBMSã®å Žåã補åã®äœæè ã¯SQLã䜿çšããããŒãã«ã®æ¥ç¶ãšæŽæ°ãå§èšããå¿ èŠããããšèããŸããã
Sberbankã®ãããžã§ã¯ãã§ã¯ãGreenPlumããŒã¿ããŒã¹ããŒããŒã®åå©çšå¯èœãªæ°ããå®è£ ãäœæããŸãããããã¯ãMDWãTeradataçšã«çæããããŒãžã§ã³ã«åºã¥ããŠè¡ãããŸãããããã«æãããè¿ã¥ããã®ã¯Oracleã§ã¯ãªãTeradataã§ãããMPPã·ã¹ãã ã§ããããŸããTeradataãšGreenPlumã®æ§æã ãã§ãªããäœæ¥æ¹æ³ãåæ§ã§ããããšãå€æããŸããã
ç°ãªãRDBMSéã®MDWã®éèŠãªéãã®äŸã¯æ¬¡ã®ãšããã§ããGreenPlumã§ã¯ãTeradataãšã¯ç°ãªããããŒãã«ãäœæãããšãã«å¥ãèšè¿°ããå¿ èŠããããŸã
distributed by
Teradataæžã蟌ã¿
delete <table> all
ããããŠGreenePlumã§åœŒãã¯æžããŸã
delete from <table>
Oracleã¯æé©åã®ããã«æžã蟌ã¿ãŸã
delete from t where rowid in (< t >)
ãTeradataãšGreenPlumã¯
delete from t where exists (select * from delta where delta.pk=t.pk)
ãŸããAb InitioãGreenPlumãšé£æºãããã«ã¯ãAb Initioã¯ã©ã¹ã¿ãŒã®ãã¹ãŠã®ããŒãã«GreenPlumã¯ã©ã€ã¢ã³ããã€ã³ã¹ããŒã«ããå¿ èŠããã£ãããšã«ã泚æããŠãã ãããããã¯ãã¯ã©ã¹ã¿ãŒå ã®ãã¹ãŠã®ããŒãããåæã«GreenPlumã«æ¥ç¶ããããã§ãããŸããGreenPlumããã®èªã¿åãã䞊åã«ããå䞊åAb Initioã¹ã¬ãããGreenPlumããããŒã¿ã®ç¬èªã®éšåãèªã¿åãããã«ã¯ãAb Initioãç解ããæ§æãSQLã¯ãšãªã®ãwhereãã»ã¯ã·ã§ã³ã«é 眮ããå¿ èŠããããŸããã
where ABLOCAL()
å€æããŒã¿ããŒã¹ããèªã¿åããã©ã¡ãŒã¿ãŒãæå®ããŠããã®æ§é ã®å€ã決å®ããŸã
ablocal_expr=«string_concat("mod(t.", string_filter_out("{$TABLE_KEY}","{}"), ",", (decimal(3))(number_of_partitions()),")=", (decimal(3))(this_partition()))»
ããã¯æ¬¡ã®ããã«ã³ã³ãã€ã«ãããŸã
mod(sk,10)=3
ãã€ãŸã GreenPlumã«åããŒãã£ã·ã§ã³ã®æ瀺çãªãã£ã«ã¿ãŒãäŒããå¿ èŠããããŸããä»ã®ããŒã¿ããŒã¹ïŒTeradataãOracleïŒã®å ŽåãAb Initioã¯ãã®äžŠååãèªåçã«å®è¡ã§ããŸãã
HiveãšGreenPlumã䜿çšããå Žåã®Ab Initioã®ããã©ãŒãã³ã¹ç¹æ§ã®æ¯èŒ
Sberbankã§å®éšãè¡ãããHiveãšGreenPlumãšã®é¢ä¿ã§MDWã«ãã£ãŠçæãããã°ã©ãã®ããã©ãŒãã³ã¹ãæ¯èŒããŸãããå®éšã®äžéšãšããŠãHiveã®å ŽåãAb Initioãšåãã¯ã©ã¹ã¿ãŒã«5ã€ã®ããŒãããããGreenPlumã®å Žåãå¥ã®ã¯ã©ã¹ã¿ãŒã«4ã€ã®ããŒãããããŸãããããããHiveã«ã¯ãGreenPlumãããããŒããŠã§ã¢ã®ç¹ã§åªããŠããç¹ãããã€ããããŸãã
HiveãšGreenPlumã§ããŒã¿ãæŽæ°ããåãã¿ã¹ã¯ãå®è¡ãã2çµã®ã°ã©ããèŠãŸãããMDWã³ã³ãã£ã®ã¥ã¬ãŒã¿ãŒã«ãã£ãŠçæãããã°ã©ããèµ·åãããŸããã
- åæåããŒã+ã©ã³ãã ã«çæãããããŒã¿ã®HiveããŒãã«ãžã®å¢åããŒã
- ããŒãã®åæå+ã©ã³ãã ã«çæãããããŒã¿ã®åãGreenPlumããŒãã«ãžã®å¢åããŒã
ã©ã¡ãã®å ŽåïŒHiveãšGreenPlumïŒã¯ãåãAb Initioã¯ã©ã¹ã¿ãŒäžã§10ã®äžŠåã¹ã¬ããã§ããŠã³ããŒããéå§ããŸãããAb Initioã¯ãèšç®çšã®äžéããŒã¿ãHDFSã§ä¿åããŸããïŒAb Initioã«é¢ããŠã¯ãHDFSã䜿çšããMFSã¬ã€ã¢ãŠãã䜿çšãããŸããïŒãã©ã³ãã ã«çæãããããŒã¿ã®1è¡ã¯ãã©ã¡ãã®å Žåã200ãã€ããå ããŠããŸããã
çµæã¯æ¬¡ã®ããã«ãªããŸãïŒ
HiveïŒ
Hiveã§ã®ããŒãã®åæå | |||
æ¿å ¥ãããè¡ | 6,000,000 | 60,000,000 | 600,000,000 |
è² è·ãåæåããæéïŒç§ïŒ |
41 | 203 | 1 601 |
Hiveã§ã®å¢åèªã¿èŸŒã¿ | |||
å®éšéå§æã®ã¿ãŒã²ããããŒãã«ã®è¡æ° |
6,000,000 | 60,000,000 | 600,000,000 |
å®éšäžã«ã¿ãŒã²ããããŒãã«ã«é©çšããããã«ã¿è¡ã®æ° |
6,000,000 | 6,000,000 | 6,000,000 |
ç§åäœã®å¢åããŠã³ããŒãæé |
88 | 299 | 2541 |
GreenPlum:
GreenPlum | |||
6 000 000 | 60 000 000 | 600 000 000 | |
|
72 | 360 | 3 631 |
GreenPlum | |||
,
|
6 000 000 | 60 000 000 | 600 000 000 |
,
|
6 000 000 | 6 000 000 | 6 000 000 |
|
159 | 199 | 321 |
HiveãšGreenPlumã®äž¡æ¹ã§ããŒãã®åæåã®é床ã¯ããŒã¿éã«çŽç·çã«äŸåããããŒããŠã§ã¢ãåªããŠãããããGreenPlumãããHiveã®æ¹ãããããé«éã§ãã
Hiveã®å¢åèªã¿èŸŒã¿ããã¿ãŒã²ããããŒãã«ã«ä»¥åã«èªã¿èŸŒãŸããããŒã¿ã®éã«çŽç·çã«äŸåãããã®éãå¢ããã«ã€ããŠé ããªããŸããããã¯ãã¿ãŒã²ããããŒãã«ãå®å šã«äžæžãããå¿ èŠãããããã§ããããã¯ã倧ããªããŒãã«ã«å°ããªå€æŽãé©çšããããšã¯ãHiveã®è¯ããŠãŒã¹ã±ãŒã¹ã§ã¯ãªãããšãæå³ããŸãã
GreenPlumã§ã®å¢åèªã¿èŸŒã¿ã¯ãã¿ãŒã²ããããŒãã«ã§å©çšå¯èœãªä»¥åã«èªã¿èŸŒãŸããããŒã¿ã®éã«åŒ±ãäŸåããéåžžã«é«éã§ããããã¯ãSQLçµåãšãåé€æäœãå¯èœã«ããGreenPlumã¢ãŒããã¯ãã£ã®ãããã§çºçããŸããã
ãããã£ãŠãGreenPlumã¯åé€+æ¿å ¥ã¡ãœããã䜿çšããŠãã«ã¿ãæ³šå ¥ããŸãããHiveã«ã¯åé€ãŸãã¯æŽæ°æäœããªããããå¢åæŽæ°äžã«ããŒã¿é åå šäœãå®å šã«æžãæããå¿ èŠããããŸãããæãç®ç«ã€ã®ã¯ã倪åã§åŒ·èª¿è¡šç€ºãããŠããã»ã«ã®æ¯èŒã§ããããã¯ããªãœãŒã¹ã倧éã«æ¶è²»ããããŠã³ããŒãã®æäœã§æãé »ç¹ã«çºçããããªãšãŒã·ã§ã³ã«å¯Ÿå¿ããŠããããã§ãããã®ãã¹ãã§ã¯ãGreenPlumãHiveã«8ååã£ãããšãããããŸãã
ã»ãŒãªã¢ã«ã¿ã€ã ã§ã®GreenPlumã«ããAb Initio
ãã®å®éšã§ã¯ãã©ã³ãã ã«çæãããããŒã¿ã®ãã£ã³ã¯ã§GreenPlumããŒãã«ãã»ãŒãªã¢ã«ã¿ã€ã ã§æŽæ°ããAb Initioã®æ©èœããã¹ãããŸããäœæ¥ããããŒãã«GreenPlum dev42_1_db_usl.TESTING_SUBJ_org_finvalã«ã€ããŠèããŸãã
3ã€ã®Ab Initioã°ã©ãã䜿çšããŠäœæ¥ããŸã
ã1ïŒCreate_test_data.mpã°ã©ã-HDFSã®ããŒã¿ã䜿çšããŠã10ã®äžŠåã¹ããªãŒã ã§6,000,000è¡ã®ãã¡ã€ã«ãäœæããŸããããŒã¿ã¯ã©ã³ãã ã§ããã®æ§é ã¯ããŒãã«ã«æ¿å ¥ããããã«ç·šæãããŠããŸã
2ïŒã°ã©ãmdw_load.day_one.current.dev42_1_db_usl_testing_subj_org_finval.pset-10ã®äžŠåã¹ã¬ããã§ããŒãã«ãžã®ããŒã¿æ¿å ¥ãåæåããããã«çæãããMDWã°ã©ãïŒã°ã©ãïŒ1ïŒã«ãã£ãŠçæããããã¹ãããŒã¿ã䜿çšãããŸãïŒ
3ïŒã°ã©ãmdw_load.regular.current.dev42_1_db_usl_testing_subj_org_finval.pset-ã°ã©ãã«ãã£ãŠçæãããæ°ããçä¿¡ããŒã¿ïŒãã«ã¿ïŒã®äžéšã䜿çšããŠã10ã®äžŠåã¹ã¬ããã§ããŒãã«ãå¢åæŽæ°ããããã«çæãããMDWã°ã©ãïŒ1ïŒ
NRTã¢ãŒãã§æ¬¡ã®ã¹ã¯ãªãããå®è¡ããŸãã
- 6,000,000ã®ãã¹ãã©ã€ã³ãçæãã
- ããŒããåæåããŠã6,000,000ãã¹ãè¡ã空ã®ããŒãã«ã«æ¿å ¥ããŸãã
- å¢åããŠã³ããŒãã5åç¹°ãè¿ããŸã
- 6,000,000ã®ãã¹ãã©ã€ã³ãçæãã
- ããŒãã«ã«6,000,000ãã¹ãè¡ã®å¢åæ¿å ¥ãäœæããŸãïŒãã®å Žåãå€ãããŒã¿ã«ã¯æå¹æévalid_to_tsãã¹ã¿ã³ããããåãäž»ããŒãæã€ããæ°ããããŒã¿ãæ¿å ¥ãããŸãïŒã
ãã®ãããªã·ããªãªã¯ãç¹å®ã®ããžãã¹ã·ã¹ãã ã®å®éã®éçšã¢ãŒãããšãã¥ã¬ãŒãããŸããæ°ããããŒã¿ã®ããªãã®éšåããªã¢ã«ã¿ã€ã ã§è¡šç€ºãããããã«GreenPlumã«æµã蟌ã¿ãŸãã
ã¹ã¯ãªããã®ãã°ãèŠãŠã¿ãŸãããïŒ2020-06-04 11:49:11ã«Create_test_data.input.psetãéå§ããŸãã2020-06-0411 : 49 :
37ã«Create_test_data.input.psetã
çµäºã
ãŸãã 2020幎6æ4æ¥11æ49åäžåäžç§ã§
2020幎6æ4æ¥11æ50å42ç§ã§ãã£ããã·ã¥mdw_load.day_one.current.dev42_1_db_usl_testing_subj_org_finval.pset
2020幎6æ4æ¥11æ50å42ç§ã§ã¹ã¿ãŒãCreate_test_data.input.pset
ãã£ããã·ã¥Create_test_data.input.pset at 2020-06-04 11:51:06
Start mdw_load.regular.current.dev42_1_db_usl_testing_subj_org_finval.pset at 2020-06-04 11:51:06
Finish mdw_load.regular.current.dev42_1_db_usl_testing_subj_org_finval.pset at 2020-06-04 11:53:41
Start Create_test_data.input.pset at 2020-06-04 11:53:41
Finish Create_test_data.input.pset at 2020-06-04 11:54:04
Start mdw_load.regular.current.dev42_1_db_usl_testing_subj_org_finval.pset at 2020-06-04 11:54:04
Finish mdw_load.regular.current.dev42_1_db_usl_testing_subj_org_finval.pset at 2020-06-04 11:56:51
Start Create_test_data.input.pset at 2020-06-04 11:56:51
Finish Create_test_data.input.pset at 2020-06-04 11:57:14
Start mdw_load.regular.current.dev42_1_db_usl_testing_subj_org_finval.pset at 2020-06-04 11:57:14
Finish mdw_load.regular.current.dev42_1_db_usl_testing_subj_org_finval.pset at 2020-06-04 11:59:55
Create_test_data.input.psetã2020-06-04ã«éå§11:59:55 Create_test_data.input.psetã2020-06-04ã«
çµäº12:00:23 mdw_load.regular.current.dev42_1_db_usl_testing_subj_org_finval.psetã2020-06-04ã«
éå§12:00:23
çµäºmdw_load.regular.current.dev42_1_db_usl_testing_subj_org_finval.pset at 2020-06-04 12:03:23
Start_test_data.input.pset at 2020-06-04 12:03:23
Finish Create_test_data.input.pset at 2020-06-04 12:03:49 2020-06-04 12:03:49ã«mdw_load.regular.current.dev42_1_db_usl_testing_subj_org_finval.psetã
éå§ãã2020-06-04 12:03:49ã«mdw_load.regular.current.dev42_1_db_usl_testing_subj_org_finval.psetã
çµäºããïŒ46
ç»åã¯æ¬¡ã®ããã«ãªããŸãã
ã°ã©ã | å§ãŸãæé | çµäºæå» | é·ã |
---|---|---|---|
Create_test_data.input.pset | 2020幎6æ4æ¥11:49:11 | 2020幎6æ4æ¥11:49:37 | 00:00:26 |
mdw_load.day_one.currentã
dev42_1_db_usl_testing_subj_org_finval.pset |
2020幎6æ4æ¥11:49:37 | 2020幎6æ4æ¥11:50:42 | 00:01:05 |
Create_test_data.input.pset | 2020幎6æ4æ¥11:50:42 | 2020幎6æ4æ¥11:51:06 | 00:00:24 |
mdw_load.regular.currentã
dev42_1_db_usl_testing_subj_org_finval.pset |
2020幎6æ4æ¥11:51:06 | 2020幎6æ4æ¥11:53:41 | 00:02:35 |
Create_test_data.input.pset | 2020幎6æ4æ¥11:53:41 | 2020幎6æ4æ¥11:54:04 | 00:00:23 |
mdw_load.regular.currentã
dev42_1_db_usl_testing_subj_org_finval.pset |
2020幎6æ4æ¥11:54:04 | 2020幎6æ4æ¥11:56:51 | 00:02:47 |
Create_test_data.input.pset | 2020幎6æ4æ¥11:56:51 | 2020幎6æ4æ¥11:57:14 | 00:00:23 |
mdw_load.regular.currentã
dev42_1_db_usl_testing_subj_org_finval.pset |
2020幎6æ4æ¥11:57:14 | 2020幎6æ4æ¥11:59:55 | 00:02:41 |
Create_test_data.input.pset | 2020幎6æ4æ¥11:59:55 | 2020幎6æ4æ¥12:00:23 | 00:00:28 |
mdw_load.regular.currentã
dev42_1_db_usl_testing_subj_org_finval.pset |
2020幎6æ4æ¥12:00:23 | 2020幎6æ4æ¥12:03:23 PM | 00:03:00 |
Create_test_data.input.pset | 2020幎6æ4æ¥12:03:23 PM | 2020幎6æ4æ¥ååŸ12:03:49 | 00:00:26 |
mdw_load.regular.currentã
dev42_1_db_usl_testing_subj_org_finval.pset |
2020幎6æ4æ¥ååŸ12:03:49 | 2020幎6æ4æ¥12:06:46 PM | 00:02:57 |
6,000,000ã®å¢åã©ã€ã³ã3åã§åŠçãããããšãããããŸããããã¯éåžžã«é«éã§ãã
ã¿ãŒã²ããããŒãã«ã®ããŒã¿ã¯ã次ã®ããã«åæ£ãããããšãããããŸããã
select valid_from_ts, valid_to_ts, count(1), min(sk), max(sk) from dev42_1_db_usl.TESTING_SUBJ_org_finval group by valid_from_ts, valid_to_ts order by 1,2;
æ¿å ¥ãããããŒã¿ãšã°ã©ãèµ·åã®ç¬éãšã®å¯Ÿå¿ã確èªã§ããŸãã
ããã¯ãéåžžã«é«ãé »åºŠã§Ab Initioã®GreenPlumãžã®ããŒã¿ã®å¢åèªã¿èŸŒã¿ãéå§ãããã®ããŒã¿ãGreenPlumã«æ¿å ¥ããé«éã芳å¯ã§ããããšãæå³ããŸãããã¡ãããAb Initioã¯ãä»ã®ETLããŒã«ãšåæ§ã«ãèµ·åæã«ãã¹ã€ã³ã°ãããã®ã«æéããããããã1ç§ã«1åã¯èµ·åã§ããŸããã
çµè«
çŸåšãAb Initioã¯Sberbankã§çµ±åã»ãã³ãã£ãã¯ããŒã¿ã¬ã€ã€ãŒïŒESSïŒãæ§ç¯ããããã«äœ¿çšãããŠããŸãããã®ãããžã§ã¯ãã«ã¯ãããŸããŸãªéè¡æ¥åãšã³ãã£ãã£ã®ç¶æ ã®åäžããŒãžã§ã³ã®æ§ç¯ãå«ãŸããŸããæ å ±ã¯ããŸããŸãªãœãŒã¹ããååŸããããã®ã¬ããªã«ã¯Hadoopã§äœæãããŸããããžãã¹ã®ããŒãºã«åºã¥ããŠãããŒã¿ã¢ãã«ãäœæãããããŒã¿å€æãèšè¿°ãããŸãã Ab Initioã¯ECCã«æ å ±ãã¢ããããŒãããŸããèªã¿èŸŒãŸããããŒã¿ã¯ããžãã¹èªäœã«é¢å¿ãããã ãã§ãªããããŒã¿ããŒããæ§ç¯ããããã®ãœãŒã¹ãšããŠãæ©èœããŸããåæã«ããã®è£œåã®æ©èœã«ãããããŸããŸãªã·ã¹ãã ïŒHiveãGreenplumãTeradataãOracleïŒãã¬ã·ãŒããŒãšããŠäœ¿çšã§ãããããããžãã¹ã«å¿ èŠãªããŸããŸãªåœ¢åŒã®ããŒã¿ãç°¡åã«æºåã§ããŸãã
Ab Initioã®æ©èœã¯å¹ åºããããšãã°ãå«ãŸããŠããMDWãã¬ãŒã ã¯ãŒã¯ã«ãããæè¡çããã³ããžãã¹ã®å±¥æŽããŒã¿ãããã«æ§ç¯ã§ããŸããéçºè ã«ãšã£ãŠãAb Initioã¯ãè»èŒªãåçºæããªããæ©äŒãæäŸããŸãããå®éã«ã¯ããŒã¿ãæäœãããšãã«å¿ èŠãªã©ã€ãã©ãªãŒã§ãããå©çšå¯èœãªæ©èœã³ã³ããŒãã³ãã®å€ãã䜿çšããŸãã
èè ã¯ãSberbankãããã§ãã·ã§ãã«ã³ãã¥ããã£SberProfi DWH / BigDataã®ãšãã¹ããŒãã§ãããããã§ãã·ã§ãã«ã³ãã¥ããã£SberProfi DWH / BigDataã¯ãHadoopãšã³ã·ã¹ãã ãTeradataãOracle DBãGreenPlumãBIããŒã«QlikãSAP BOãTableauãªã©ã®åéã§ã®èœåéçºãæ åœããŠããŸãã