Data virtualisation enables businesses to access, manage, incorporate, and aggregate data from different sources independent from its physical location or format in real-time.

As per The Data Management Association International (DAMA) and Data Management Body of Knowledge (DMBOK), “Data virtualisation enables distributed databases, as well as several heterogeneous data stores, to be accessed and viewed as a single database. Rather than physically performing ETL (Extract, Transform, and Load) on data with transformation engines, data virtualisation offers performing data extraction, transformation, and integration virtually.”

Why Data Virtualisation Evolved?

With the fast-changing business world, information has become an essential production factor. Data-driven decision-making is a tool to withstand the growing competition around global industries and markets. Exploiting the power of business intelligence (BI) or analytics and automating workflows is one way for businesses to generate new revenue while reducing costs by enhancing the efficiency of their daily processes’.

Today, enterprise data is preserved in different locations and comes in various, fast evolved forms like:

  • Social media or website data such as Facebook, Twitter or Google Analytics
  • Relational and non-relational databases such as MySQL, Amazon Redshift or MongoDB
  • CRM or ERP data such as SAP, Oracle or Microsoft Dynamics
  • Flat files such as XML, CSV or JSON
  • Cloud or software-as-a-service applications such as Netsuite, Salesforce or Mailchimp
  • Data lakes and enterprise data warehouses
  • Big data

Businesses have been facing increasing volumes of data accompanied by growing data variety and velocity. This often leads to challenges like obtaining trustworthy data quality, time efficiency in data management, and self-service capacities for data users. Conquering these challenges efficiently and effectively became critical for modern enterprises’ success.

Data virtualisation helps businesses to deal with these challenges using the full potential of their data. Data virtualisation’s primary concept is to break free from the requirement of knowing every technical detail of the data like its exact physical location or its root format. It enables integration and aggregation of data from disparate physical sources and diverse formats within one view without moving the data into central storage. As all data remains in the source systems, data virtualisation builds a virtual/logical layer to enable real-time accessibility with the possible manipulation and transformation of data in virtual views. This virtual layer permits data management that is simpler and more efficient. Data virtualisation tools usually make data accessible with SQL, REST, or other standard data query methods, regardless of the source’s file format. It further simplifies data management efforts; however, this depends on its solution and still isn’t standardised.

#big data #latest news #data-science

Data Virtualisation and why has it Evolved?
1.95 GEEK