what is data warehousing?

322 views

what is data warehousing?

In: 1

2 Answers

Anonymous 0 Comments

[removed]

Anonymous 0 Comments

A data warehouse is structured data, it’s a large-scale collection of data that is in a usable format. Compare that to a data lake, which is a large-scale collection of data that is in any format you want.

So, imagine you have a company with 10 different branches throughout the world. You may have them all set up so, they all work the exact same and send the main office the same data in the same format. In this case, you would put the data in a structured data warehouse. From there, your main office can pull that data into all of its various systems, and all of your branches can do the same.

However, you may not have that set up so efficiently, and some things may be left up for the branches to decide for themselves. So, all of the data they send to the main office will be of varying formats, maybe even different currencies. So, you would need to process that and format it, to clean up the data to be sure it’s all in a similar format. That way, your main office can understand what each of the branches’ individual performance is in a standardized way.

You may also have some data that is collected, and you just don’t know what to do with it. In that case, you would use a data lake also, as its unstructured. However, you would likely want to process it and eventually move it over into a data warehouse when you do finally find a use for it, and then structure it.