Time To Dive Into the Data Lake: Dell EMC

businesswoman on a diving board above a lake of numbers

Posted on Wednesday, October 30, 2019

Ever since the boom in analytics, data lakes have become a hot topic for CTOs. Consolidating disparate data stores into a single giant resource offers several benefits as Sai Devulapalli, global head of data analytics solutions and go-to-market in the Unstructured Data Solutions Unit at Dell EMC, outlined in CIO Magazine recently. 

According to Devulapalli, there are four advantages to breaking out of traditional data silos.

1. A reduction in data movement

Normally data is placed into various intermediate staging areas for consolidation and cleansing before analysis begins. By carrying out these functions as soon as information enters the lake, you can eliminate any intermediate stages and reduce demand for temporary storage capacity.

2. Simplified data governance

Auditing and governance are major headaches when data is spread across multiple stores, particularly where duplicate information exists. A single lake repository means your governance team has only one information store to work with.

3. Improved analytics testing

AI and analytics testing typically relies on synthesized data in sandboxes; the outcome of these tests is only as good as the synthesized data sets. Providing read-only access to the data lake allows developers to test using ‘real’ data, shortening the development cycle and improving the quality of output. 

4. Increased AI analytics flexibility

Loading data into AI platforms duplicates information and increases demand for capacity. Using the data lake model, you can abstract data from AI/analytics tools, leaving you free to change applications as newer, more powerful alternatives become available. 

Building for the future, today

As you would expect, Dell EMC’s data lake advice is intended to sell additional hardware. And for demanding AI applications, the additional investment may make sense.

But thanks to the vendor-agnostic concepts behind data lakes there is nothing to stop your business redeploying existing disk arrays– even those that are EoSL or out of warranty. All that matters is having the necessary capacity to pool your data sets centrally.

You can expand your lake as required, and there’s nothing to stop you realizing these four benefits almost immediately.

To learn more about redeploying older Dell EMC hardware– and obtaining maintenance and support for post-warranty systems– please give us a call.

Download article as a PDF - Time To Dive Into the Data Lake: Dell EMC

More Articles

Magnifying glass enlarging a molecule

DNA Storage Has a New Molecule-Sized Competitor

Smaller than a DNA strand, molecular storage could be about to change the future of data retention once again.