As the data lake market matures, the differences between data lakes and data warehouses become clearer. Data lakes are architected for scalability, and thus store and process data quite differently from the earlier generation data warehouses.
Arcadia Data, in contrast, built its technology to run natively on data lakes – in particular, ones that run on Hadoop. The Hadoop File System (HDFS) is distributed and horizontally scalable, and Arcadia Data runs on each HDFS node.
This distributed architecture gives Arcadia Data markedly better performance than traditional analytics and BI tools, and also brings the processing to the data, thus addressing the data gravity issues that big data sets can struggle with.
Copyright © Intellyx LLC. Intellyx publishes the Agile Digital Transformation Roadmap poster, advises companies on their digital transformation initiatives, and helps vendors communicate their agility stories. As of the time of writing, none of the organizations mentioned in this article are Intellyx customers. To be considered for a Brain Candy article, email us at firstname.lastname@example.org.