Posts

Showing posts from May, 2023

Primary research questionnaire

I created the questionnaire using google forms. The aim of it is to perform a survey about Ethical Considerations in Big Data. There are 10 questions altogether.  This is a link to the survey:  https://forms.gle/V46mjPwc2qR8mVR77  .

Diagram2 - data processing cycle

Image
  The data processing cycle, also known as the information processing cycle, refers to the sequence of steps involved in transforming raw data into meaningful information.  References: A, Geetha & Iyenger, N Ch Sriman Narayana. (2014). Privacy Preservation Using Intelligent Techniques. 10.13140/RG.2.2.11993.98402. 

Diagram 1 - Big Data

Image
  The three V's in big data refer to Volume, Velocity, and Variety. These characteristics describe the key aspects of big data that distinguish it from traditional data sources. Volume which refers to the massive amount of data generated every second and applies to the size and scale of a dataset. Big data is characterized by its sheer volume, often involving terabytes, petabytes, or even larger amounts of data. 'Velocity comprises the speed (represented in terms of batch, near-real-time, real-time, and streaming) of data processing, emphasizing that the speed with which the data is processed must meet the speed with which the data is produced.'  Variety  refers to the different forms of data in a dataset including structured data, semi-structured data, and unstructured data. Structured data (e.g., stored in a relational database) is mostly well-organized and easily sorted, but unstructured data (e.g., text and multimedia content) is random and difficult to analyse...

Plan for Major Project

Image
 Plan for Major Project Updated version  

Initial project Plan (Gantt Chart)

Image
 

Major Project Theme

Image
Big Data  In recent years the amount of data that is being generated increased dramatically making way for big data. Big data refers to data so big that traditional processing tools can not handle such an amount effectively. According to IBM big data can be defined as 'data sets whose size or type is beyond the ability of traditional relational databases to capture, manage and process the data with low latency.' The attributes that define big data are volume, variety, velocity, and variability. Big data became an integral part of many industries. With the usage of the internet and the development of technology more data is being stored. It is due to the Internet of Things, Artificial intelligence, social media or simply mobile devices where data is being constantly collected. Machine learning is a part of artificial intelligence that involves the use of algorithms to enable machines to learn from data and improve their performance over time without being explicitly programmed....