What is the default method of splitting in decision trees?

The default method of splitting in decision trees is the Gini Index. Gini Index is the measure of impurity of a particular node. This can be changed by making changes to classifier parameters.   The default method of splitting in decision trees depends on the specific implementation or library being used. However, one common default … Read more

What are the performance metrics that can be used to estimate the efficiency of a linear regression model?

The performance metric that is used in this case is: Mean Squared Error R2 score Adjusted R2 score Mean Absolute score   In the context of a machine learning interview, when asked about performance metrics for evaluating the efficiency of a linear regression model, you can mention several commonly used metrics. Here are some key … Read more

How would you define the number of clusters in a clustering algorithm?

The number of clusters can be determined by finding the silhouette score. Often we aim to get some inferences from data using clustering techniques so that we can have a broader picture of a number of classes being represented by the data. In this case, the silhouette score helps us determine the number of cluster … Read more

Explain Eigenvectors and Eigenvalues.

Linear transformations are helpful to understand using eigenvectors. They find their prime usage in the creation of covariance and correlation matrices in data science. Simply put, eigenvectors are directional entities along which linear transformation features like compression, flip etc. can be applied. Eigenvalues are the magnitude of the linear transformation features along each direction of … Read more

What are the advantages and disadvantages of using an Array?

Advantages: Random access is enabled Saves memory Cache friendly Predictable compile timing Helps in re-usability of code Disadvantages: Addition and deletion of records is time consuming even though we get the element of interest immediately through random access. This is due to the fact that the elements need to be reordered after insertion or deletion. … Read more