Friday, January 23, 2015

Data Systems

Generally all Data Systems have below components:

1. Data Collection:
Enables feeding data into the system. Copy log files, sync with external database etc

2. Data Storage
Stores data. Can be disk based, or Database too.

3. Processing
Processes the large volume of data. Data can be stored on disk or database.

4. Querying
Fetch the data from Data Storage with appropriate filters.

5. Reporting
Display/Visualize/Generate reports Information derived from the Data.

Data Systems components

What differentiates BigData systems with Traditional Data Systems is Scalability. Traditional systems satisfied the requirement of the past era where a single installation of database could meet the requirement of the whole organization. In the age of internet, Such systems do not cater to process webscale of data.

Difference b/w Traditional data systems and BigData systems

BigData systems:

There are many BigData systems available, below is the list of few well known systems.

1. Hadoop: Developed and opensourced by Yahoo!
2. Cosmos: Microsoft BigData system
3. Oracle Big Data Appliance: Hardware and Software combo from Oracle.


Among these, Hadoop is well known opensource BigData ecosystem.


No comments:

Post a Comment