Time To Dive Into the Data Lake: Dell EMC
Posted on Wednesday, October 30, 2019
Ever since the boom in analytics, data lakes have become a hot topic for CTOs. Consolidating disparate data stores into a single giant resource offers several benefits as Sai Devulapalli, global head of data analytics solutions and go-to-market in the Unstructured Data Solutions Unit at Dell EMC, outlined in CIO Magazine recently.
According to Devulapalli, there are four advantages to breaking out of traditional data silos.
1. A reduction in data movement
Normally data is placed into various intermediate staging areas for consolidation and cleansing before analysis begins. By carrying out these functions as soon as information enters the lake, you can eliminate any intermediate stages and reduce demand for temporary storage capacity.
2. Simplified data governance
Auditing and governance are major headaches when data is spread across multiple stores, particularly where duplicate information exists. A single lake repository means your governance team has only one information store to work with.
3. Improved analytics testing
AI and analytics testing typically relies on synthesized data in sandboxes; the outcome of these tests is only as good as the synthesized data sets. Providing read-only access to the data lake allows developers to test using ‘real’ data, shortening the development cycle and improving the quality of output.
4. Increased AI analytics flexibility
Loading data into AI platforms duplicates information and increases demand for capacity. Using the data lake model, you can abstract data from AI/analytics tools, leaving you free to change applications as newer, more powerful alternatives become available.
Building for the future, today
As you would expect, Dell EMC’s data lake advice is intended to sell additional hardware. And for demanding AI applications, the additional investment may make sense.
But thanks to the vendor-agnostic concepts behind data lakes there is nothing to stop your business redeploying existing disk arrays– even those that are EoSL or out of warranty. All that matters is having the necessary capacity to pool your data sets centrally.
You can expand your lake as required, and there’s nothing to stop you realizing these four benefits almost immediately.
To learn more about redeploying older Dell EMC hardware– and obtaining maintenance and support for post-warranty systems– please give us a call.
DNA Storage Has a New Molecule-Sized Competitor
Smaller than a DNA strand, molecular storage could be about to change the future of data retention once again.