Regression models
P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON
Conor Dewey
Data Scientist, Squarespace
Regression models P R AC TIC IN G STATISTIC S IN TE R VIE W QU E - - PowerPoint PPT Presentation
Regression models P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON Conor De w e y Data Scientist , Sq u arespace Getting started 1 Wikimedia PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON Ass u mptions Linear
P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON
Conor Dewey
Data Scientist, Squarespace
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Wikimedia
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Linear relationship Errors are normally distributed Homoscedasticity Independent observations
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Wikipedia
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
from sklearn.linear_model import LinearRegression lm = LinearRegression() lm.fit(X_train, y_train) LinearRegression(copy_X=True, fit_intercept=True, n_jobs=None, normalize=False)
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
coef = lm.coef_ print(coef) [0.79086669]
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Wikimedia
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
from sklearn.linear_model import LogisticRegression clf = LogisticRegression(solver='lbfgs') clf.fit(X_train, y_train) LogisticRegression(C=1.0, class_weight=None, dual=False, fit_intercept=True, intercept_scaling=1, max_iter=100, multi_class='warn', n_jobs=None, penalty='l2', random_state=None, solver='lbfgs', tol=0.0001, verbose=0, warm_start=False)
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
coefs = clf.coef_ print(coefs) [[0.4015177 3.85056451]] accuracy = clf.score(X_test, y_test) print(accuracy) 0.8583333333333333
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Review Assumptions Linear regression Logistic regression
P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON
P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON
Conor Dewey
Data Scientist, Squarespace
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
R-squared Mean absolute error (MAE) Mean squared error (MSE)
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Wikimedia
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Wikimedia
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
120 Data Science Interview Questions
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Precision Recall Confusion matrices
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
AB Tasty
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
AB Tasty
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
AB Tasty
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
R-squared Mean absolute error (MAE) vs. mean squared error (MSE) Precision and recall
P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON
P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON
Conor Dewey
Data Scientist, Squarespace
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Drop the whole row Impute missing values
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
df.dropna(inplace=True)
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Constant value Randomly selected record Mean, median, or mode Value estimated by another model
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
isnull() dropna() fillna()
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Standard deviations Interquartile range (IQR)
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Wikimedia
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Wikimedia
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Drop the whole row Impute missing values Standard deviations Interquartile range
P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON
P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON
Conor Dewey
Data Scientist, Squarespace
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Bias error Variance error Irreducible error
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
How to Use Machine Learning to Predict the Quality of Wines
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
How to Use Machine Learning to Predict the Quality of Wines
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Sco Fortmann
1
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Types of error Bias error Variance error Bias-variance tradeo
P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON
P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON
Conor Dewey
Data Scientist, Squarespace
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Conditional probabilities Central limit theorem Probability distributions
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Descriptive statistics Categorical data Encoding techniques Multivariate relationships
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Condence intervals Hypothesis testing Power analysis Multiple comparisons
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Linear regression Logistic regression Missing data and outliers Bias-variance tradeo
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Simulate the interview environment Practice explaining big concepts Know the business or product well Come prepared with ideas
PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Data Science Career Resources Repo Practical Statistics for Data Scientists 120 Data Science Interview Questions Advice Applying to Data Science Jobs
P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON