Explain what is K-mean Algorithm?

K mean is a famous partitioning method. Objects are classified as belonging to one of K groups, k chosen a priori. In K-mean algorithm, The clusters are spherical: the data points in a cluster are centered around that cluster The variance/spread of the clusters is similar: Each data point belongs to the closest cluster The … Read more

Explain what is Hierarchical Clustering Algorithm?

Hierarchical clustering algorithm combines and divides existing groups, creating a hierarchical structure that showcase the order in which groups are divided or merged. Hierarchical clustering is a popular method used in data analytics and machine learning for grouping similar data points into clusters based on their characteristics. It’s an unsupervised learning algorithm, meaning it doesn’t … Read more

Mention how to deal the multi-source problems?

To deal the multi-source problems, Restructuring of schemas to accomplish a schema integration Identify similar records and merge them into single record containing all relevant attributes without redundancy When dealing with multi-source data in data analytics, it’s important to have a systematic approach to integrate, clean, and analyze the data effectively. Here are some key … Read more

Explain what should be done with suspected or missing data?

Prepare a validation report that gives information of all suspected data. It should give information like validation criteria that it failed and the date and time of occurrence Experience personnel should examine the suspicious data to determine their acceptability Invalid data should be assigned and replaced with a validation code To work on missing data … Read more

Mention what are the data validation methods used by data analyst?

Usually, methods used by data analyst for data validation are Data screening Data verification Data validation is a crucial aspect of data analysis to ensure the accuracy, reliability, and consistency of the data being analyzed. Here are some common data validation methods used by data analysts: Manual Inspection: This involves visually inspecting the data to … Read more