An important concept regarding machine learning.
Distribution: Distribution shows us how the probabilities of measurements are distributed.
1. Normal Distribution:- Normal distribution is always centered on the average value.
Standard Deviation: The width of the curve defines the 'standard deviation'. It is called a central limit theorem. Width of the mean value.
Mean (x̅ ): Center value or average value of the distribution.
Both represent the population called as Population Parameter. The parameters that determine how a distribution fits the population data are called Population Parameters.
2. Exponential Distribution:- It is also used for probabilities and statistics.
3. Gamma Distribution:- It is also used for probabilities and statistics. In this type of distribution, shape and rate are parameters.
Variance:
Standard Deviation:
Model: A model is a way to explore a relationship. We use statistics to determine how useful and how reliable our model is.
Hypothesis: Hypothesis means the assumption of data. Trying to find out things are the same or not. There is hypothesis testing.
- Null hypothesis testing: In this testing, there is no difference between the two things. Or no difference between the mean of groups. it is denoted as HA. 
 - Alternative hypothesis testing: It is opposite to the Null hypothesis. In this testing at least one group mean is different. It is denoted as Ho.
 
- True Positive: We predicted yes and the actual result is also yes.
 - True Negative: We predicted no and the actual result is also no.
 - False Positive: We predicted yes but the actual result is no
 - False Negative: We predicted no but the actual result is yes.
 
- Horizontal Data Format: 
 - Vertical Data Format:
 
- 100% of sets with apples contain bananas.
 - 50% of sets with apple, banana have watermelon.
 - 50% of sets with apple, banana have any other fruits.
 
- Mean: Avg value. Used when data is not normally distributed. In this case, outliers occur. Outliers: It can occur by chance in distribution but they indicate either measurement error or heavy-tailed distribution.
 - Median: Middle value. Used when dealing with ordinal data. Ex. Strongly dislike, dislike, like, strongly like.
 - Mode: The value that occurs frequently in the dataset. Used when dealing with unordered categories data.
 - Skewness: If it looks the same to the left and right of the center.
 - Kurtosis: Measure of whether the data are peaked or flat relative to the rest of the data.
 












Comments
Post a Comment
If you have any doubt, please let me know. To check my other blog kindly check the following links:
https://pythoholic.blogspot.com/
If you are interested in reading Marathi stories and other stuff, kindly check the following link.
https://pratilipi.page.link/q8dZ4ffZwKPHUx6R9
ꜰᴏʀ ᴇxᴘʟᴏʀɪɴɢ ᴛʜᴇ ᴡᴏʀʟᴅ ᴘʟᴇᴀꜱᴇ ʜᴀᴠᴇ ʟᴏᴏᴋ ᴀɴᴅ ꜰᴏʟʟᴏᴡ.
https://maps.app.goo.gl/jnKyzdDpKMFutUqR7