Data Warehouse Optimization with Apache Hadoop
For the past few years, we have heard a lot about the benefits of augmenting the Enterprise Data Warehouse with Hadoop. The Data Warehouse vendors as well as the Hadoop vendors are showcasing how Hadoop can handle unstructured data while the EDW will continue to remain as the central source in an enterprise.
The Enterprise Data Warehouse (EDW) is a standard component of a corporate data architecture because it provides valuable business insights and powerful decision analytics for front-line workers, executives, business analysts, data scientists, and software developers. The Enterprise Data Warehouse built using Teradata, Oracle, DB2 or other DBMS is undergoing a revolutionary change. As the sources of data become rich and diverse, storing them in a traditional EDW is not the optimal solution. Big data technologies such as Apache Hadoop excel at managing large volumes of unstructured data and are coming into mainstream use, by integrating with existing legacy Data Warehouse platforms to get the best of both worlds.