Today, it is possible to see information in all areas of life.. Now everyone has a smart phone in their pocket, a computer in everyone’s home, and information technology management units in the back offices of all companies.. Today, not only the amount of information has increased, but also the speed of access to information has increased.. The concept of big data was first used in the field of astronomy and genetics, then this concept was used for every field.
Big Data
All data collected from different sources such as social media shares, blogs, blogs, photos, videos, log files is converted into a meaningful and workable format.
Big data, logs of web servers, internet statistics , social media publications, blogs, microblogs, information from climate sensors and similar sensors, call records obtained from GSM operators, etc. It enables them to take risks, manage their risks better, and innovate.
Most companies continue their strategies based on the data they obtain through data collection methods.. Big data emerges as a term that includes many subjects such as the creation, storage, flow and analysis of this big data, which is difficult to process with traditional database tools.. As the data is too large for classical databases to handle, the growth rate of data exceeds a computer or a data storage unit.. With 2012 figures, 2.5 quintillion bytes of data are produced daily in the world.. All of the works such as processing and transferring big data in this scale are called Big Data).
Today’s databases are not enough to keep the data growing in this scale.. While relational databases can hold data at gigabyte level, with big data we can store data at petabyte level.. It should be used in cases where big data is written once and read many times.. Because data is processed in parallel in more than one place.
One of the most important contributions of the Internet is undoubtedly the point of access to information.. Although information is presented in various forms, the internet has also been instrumental in very important developments in accessing information through books, which is the classical method.. Searching for books on the internet and delivering them to the address is a very important contribution compared to the pre-internet.. In addition, the digitization of books by companies such as Google removes the limits on access to books.. It is seen that 130 million different books with different titles have been published so far, and about 20 million of these books have been transferred to digital media by Google.
Information gathering via the Internet goes beyond being just a passive collector.. The huge information Google collects for street images has started to be used not only for Google Earth application, but also for GPS services.. The company also has an inventory of Wi-Fi connections on the streets.. Searches on search engines are a good example of this.. Facebook, where data is stored up to personal data, and Wikipedia sites, which serve as encyclopedias where all kinds of information are collected, can be cited as another example.
Components in Big Data
There are five components in the formation of a big data platform.. These; variety, velocity, volume, verification and value. Since it is generally explained as 5v, English equivalents can be included.
- Variety: 80 percent of the data produced is unstructured and every newly produced technology provides data in different formats. can produce. All kinds of “Data Types” have to be dealt with from phones, tablets, integrated circuits. Also, if you think that this data can be in different languages and Non-Unicode, they need to be integrated and transformed into each other.
- Velocity: The speed of production of Big Data is very high and increasing.. Data that reproduces faster results in an increase in the number and diversity of transactions that need that data at the same rate.
- Volume: According to IDC statistics, the amount of data to be reached in 2020 will be 44 times that of 2009.. It is necessary to think about the “large” capacities and “large systems” currently in use, and imagine how they will deal with data that is 44 times larger. Another component is. During the flow, it needs to be monitored at the required security level, without the right layer, and visible or hidden by the right people.
- Value: The most important component is creating value. Big Data, which is described with all the above efforts, should create an added value for the institution after your data production and processing layers.. It needs to have an instant effect on your decision-making processes, and it should be at your fingertips to make the right decision.. For example, a government agency that takes strategic decisions on health should be able to instantly see the distribution of diseases, drugs, doctors in details such as region, province, district, etc.