python - Boxplots in matplotlib: Markers and outliers -
i have questions boxplots in matplotlib:
question a. markers highlighted below q1, q2, , q3 represent? believe q1 maximum , q3 outliers, q2?

question b how matplotlib identify outliers? (i.e. how know not true max , min values?)
a picture worth thousand words. note outliers (the + markers in plot) points outside of wide [(q1-1.5 iqr), (q3+1.5 iqr)] margin below.

however, picture example distributed data set. important understand matplotlib not estimate normal distribution first , calculates quartiles estimated distribution parameters shown above.
instead, median , quartiles calculated directly data. thus, boxplot may different depending on distribution of data , size of sample, e.g., asymmetric , more or less outliers.
Comments
Post a Comment