China Cloud native data Lake application insight white paper

Concept definition: Data lake is an innovative solution for big data scenarios. It adopts a different design architecture from the traditional data warehouse. It has the characteristics of "multi-source heterogeneous data, unified storage management, multi paradigm computing, schema postposition and wide application". Cloud native is the inevitable form of data Lake deployment in the future. It has the core value of "establishing unified data assets, using basic resources at low cost, upgrading high-performance computing experience and enabling agile innovation".

Market status: data transformation, enterprise digital transformation, investment and financing and policy support will continue to accelerate the release of the application demand of cloud native data lake. In 2020, the market scale of cloud native data Lake (including ecology) will reach 12.4 billion, and it is expected to expand rapidly with a compound growth rate of 39.7% in the next three years.

Competition pattern: China Cloud native data lake is still in the early stage of development. There are few independent manufacturers that can provide overall solutions, and the market is relatively concentrated. The competition mainly focuses on the head cloud manufacturers. In terms of revenue, the share of cloud manufacturers in China's cloud native data Lake Market (excluding ecological support) reached 82.4% in 2020.

Application status: at this stage, cloud native data lake is mainly used in the Internet scene of Pan Internet industry (40.7%) and traditional industries (Pan government affairs, finance, industry, medical treatment, automobile, etc.), and will expand to more industries with big data and high-value attributes in the future.

Suggestions on model selection: in the face of diversified and fast iterative business needs, enterprises need to build a unified data base on the one hand, and pay attention to the openness, agility and innovation of DT capabilities on the other hand. When selecting cloud native data lake, in addition to internal capability evaluation, the service radius and development path of service providers should also be considered.

Trend Outlook: in the context of cloud native and big data, cloud native data lake has become a new generation productivity tool for enterprises to outwit the future, and the market is about to usher in an explosion period. Although the data Lake naturally fits with cloud and big data (massive, flexible, simple and agile), there are still many practical problems to be solved in the implementation of specific business scenarios. In the future, cloud native data Lake manufacturers need to work together with developers, ISVs and Si to continuously explore in the enterprise level production environment, and the ecological win-win drive the cloud native data Lake solution to be improved day by day.

- Advertisment -