Breaking News: Grepper is joining You.com. Read the official announcement!

logistic regression

Add Answer

Innocent Iguana answered on June 7, 2022 Popularity 10/10 Helpfulness 2/10

answer logistic regression

related logistic regression algorithm

related logistic regression python

related python logistic regression

related logistic regression is a classification algorithm

related why logistic regression instead of linear regression

related Logistic Regression in python

logistic regression

Comment

Tip Innocent Iguana 1 GREPCC

- It is a classification problem
- It is called regression because it calculates continuous value (sigmoid / likelihood /probability) 
- It compares threshold value with probability to reach conclusion 
- It works well for data with 2 class that is linearly separable by a line (decision boundary)
- For multiple classes, use softmax
- Logistic regression only calculates probability of positive (target) class only. subtracting from 1 will give us probability of negative class
- Training process:
    1. Initializes theta (co-efficients) with random values
    2. Calculate sigmoid (probability) of output for a case
    3. compare this probability with actual output and record this difference as error
    4. Calculate this error for all training cases. Total error is the cost of the model (known as cost function. eg: MSE)
    5. Change theta in such a way that it reduces total cost (optimization algorithm like gradient descent)
    6. Iterate from step 2 until the model is satisfactory enough with low cost.

xxxxxxxxxx

- It is a classification problem

- It is called regression because it calculates continuous value (sigmoid / likelihood /probability)

- It compares threshold value with probability to reach conclusion

- It works well for data with 2 class that is linearly separable by a line (decision boundary)

- For multiple classes, use softmax

- Logistic regression only calculates probability of positive (target) class only. subtracting from 1 will give us probability of negative class

- Training process:

    1. Initializes theta (co-efficients) with random values

    2. Calculate sigmoid (probability) of output for a case

    3. compare this probability with actual output and record this difference as error

    4. Calculate this error for all training cases. Total error is the cost of the model (known as cost function. eg: MSE)

    5. Change theta in such a way that it reduces total cost (optimization algorithm like gradient descent)

    6. Iterate from step 2 until the model is satisfactory enough with low cost.

Popularity 10/10 Helpfulness 2/10 Language whatever

Source: Grepper

Tags: logistic-regression whatever

Link to this answer
Share Copy Link

Contributed on Feb 13 2023

Innocent Iguana

0 Answers Avg Quality 2/10

Closely Related Answers

logistic regression algorithm

Comment

Tip Josh.ipynb 1 GREPCC

# Import the necessary modules
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import confusion_matrix, classification_report

# Create training and test sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.4, random_state=42)

# Create the classifier: logreg
logreg = LogisticRegression()

# Fit the classifier to the training data
logreg.fit(X_train, y_train)

# Predict the labels of the test set: y_pred
y_pred = logreg.predict(X_test)

# Compute and print the confusion matrix and classification report
print(confusion_matrix(y_test, y_pred))
print(classification_report(y_test, y_pred))

xxxxxxxxxx

# Import the necessary modules

from sklearn.linear_model import LogisticRegression

from sklearn.metrics import confusion_matrix, classification_report

# Create training and test sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.4, random_state=42)

# Create the classifier: logreg

logreg = LogisticRegression()

# Fit the classifier to the training data

logreg.fit(X_train, y_train)

# Predict the labels of the test set: y_pred

y_pred = logreg.predict(X_test)

# Compute and print the confusion matrix and classification report

print(confusion_matrix(y_test, y_pred))

print(classification_report(y_test, y_pred))

Popularity 10/10 Helpfulness 3/10 Language python

Source: campus.datacamp.com

Tags: algorithm algo

Link to this answer
Share Copy Link

Contributed on Jun 07 2022

josh.ipynb

0 Answers Avg Quality 2/10

logistic regression python

Comment

Tip Innocent Iguana 1 GREPCC

from sklearn.metrics import confusion_matrix
# Specify independent and dependent features
X = np.asarray(df[['A', 'B', 'C', 'D', 'E', 'F', 'G']])
y = np.asarray(df['target'])

# Preprocess dataset
from sklearn import preprocessing
X = preprocessing.StandardScaler().fit(X).transform(X)

# Split into train and test set
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split( X, y, test_size=0.2, random_state=4)

# Train the model
from sklearn.linear_model import LogisticRegression
LR = LogisticRegression(C=0.01, solver='liblinear')
LR.fit(X_train,y_train)

# Predict the test set
y_pred = LR.predict(X_test)

# See classification report and confusion matrix
from sklearn.metrics import classification_report, confusion_matrix
classification_report(y_test, y_pred)
confusion_matrix(y_test, y_pred, labels=[1,0])

# Predicted probability on test set for positive/target class
y_pred_prob = LR.predict_proba(X_test)[:, 1]

# Evaluate the model
from sklearn.metrics import jaccard_score
jaccard_score(y_test, y_pred,pos_label=0)

from sklearn.metrics import log_loss
log_loss(y_test, y_pred_prob)

from sklearn.metrics import roc_auc_score
print(roc_auc_score(y_test, y_pred_prob))

xxxxxxxxxx

from sklearn.metrics import confusion_matrix

# Specify independent and dependent features

X = np.asarray(df[['A', 'B', 'C', 'D', 'E', 'F', 'G']])

y = np.asarray(df['target'])

# Preprocess dataset

from sklearn import preprocessing

X = preprocessing.StandardScaler().fit(X).transform(X)

# Split into train and test set

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split( X, y, test_size=0.2, random_state=4)

# Train the model

from sklearn.linear_model import LogisticRegression

LR = LogisticRegression(C=0.01, solver='liblinear')

LR.fit(X_train,y_train)

# Predict the test set

y_pred = LR.predict(X_test)

# See classification report and confusion matrix

from sklearn.metrics import classification_report, confusion_matrix

classification_report(y_test, y_pred)

confusion_matrix(y_test, y_pred, labels=[1,0])

# Predicted probability on test set for positive/target class

y_pred_prob = LR.predict_proba(X_test)[:, 1]

# Evaluate the model

from sklearn.metrics import jaccard_score

jaccard_score(y_test, y_pred,pos_label=0)

from sklearn.metrics import log_loss

log_loss(y_test, y_pred_prob)

from sklearn.metrics import roc_auc_score

print(roc_auc_score(y_test, y_pred_prob))

Popularity 10/10 Helpfulness 3/10 Language python

Source: Grepper

Tags: logistic-regression logi

Link to this answer
Share Copy Link

Contributed on Feb 13 2023

Innocent Iguana

0 Answers Avg Quality 2/10

python logistic regression

Comment

Tip Innocent Iguana 1 GREPCC

from statsmodels.formula.api import logit
model = logit("target ~ x_var", data=df).fit()
print(model.params)
# Visualize logistic model
sns.regplot(x="x_var", y="target", data=df, ci=None, logistic=True)
X_test = pd.DataFrame({"x_var": np.arange(-1, 6.25, 0.25)})
y_pred_prob = model.predict(X_test)
y_pred = np.round(y_pred_prob)
# Odds ratio : p/(1-p) or probability of something happenning over not happening
odds_ratio = y_pred_prob / (1- y_pred_prob)
# Visualize odds ratio / log odds ratio
sns.lineplot(x="x_var", y="odds_ratio", data=df)
plt.axhline(y=1, linestyle="dotted")
plt.yscale("log") # If you want to make the curve into linear make y : np.log(odds_ratio)
plt.show()
# Confusion matrix
conf_matrix = model.pred_table()
TN = conf_matrix[0,0]
TP = conf_matrix[1,1]
FN = conf_matrix[1,0]
FP = conf_matrix[0,1]
# Visualize confusion matrix
from statsmodels.graphics.mosaicplot import mosaic
mosaic(conf_matrix)

xxxxxxxxxx

from statsmodels.formula.api import logit

model = logit("target ~ x_var", data=df).fit()

print(model.params)

# Visualize logistic model

sns.regplot(x="x_var", y="target", data=df, ci=None, logistic=True)

X_test = pd.DataFrame({"x_var": np.arange(-1, 6.25, 0.25)})

y_pred_prob = model.predict(X_test)

y_pred = np.round(y_pred_prob)

# Odds ratio : p/(1-p) or probability of something happenning over not happening

odds_ratio = y_pred_prob / (1- y_pred_prob)

# Visualize odds ratio / log odds ratio

sns.lineplot(x="x_var", y="odds_ratio", data=df)

plt.axhline(y=1, linestyle="dotted")

plt.yscale("log") # If you want to make the curve into linear make y : np.log(odds_ratio)

plt.show()

# Confusion matrix

conf_matrix = model.pred_table()

TN = conf_matrix[0,0]

TP = conf_matrix[1,1]

FN = conf_matrix[1,0]

FP = conf_matrix[0,1]

# Visualize confusion matrix

from statsmodels.graphics.mosaicplot import mosaic

mosaic(conf_matrix)

Popularity 10/10 Helpfulness 2/10 Language python

Source: Grepper

Tags: logistic-regression logi

Link to this answer
Share Copy Link

Contributed on Dec 23 2023

Innocent Iguana

0 Answers Avg Quality 2/10

logistic regression is a classification algorithm

Comment

Tip Itchy Ibex 1 GREPCC

logistic regression is a classification algorithm

xxxxxxxxxx

logistic regression is a classification algorithm

Popularity 10/10 Helpfulness 2/10 Language javascript

Source: Grepper

Tags: algorithm algo

Link to this answer
Share Copy Link

Contributed on Jan 07 2023

Itchy Ibex

0 Answers Avg Quality 2/10

why logistic regression instead of linear regression

Comment

Tip Innocent Iguana 1 GREPCC

Why don't we use linear regression for classification as well?

Linear regression:
- Finds a line that fits and aligns tightly with the data
- goal: line is the trend, any new value will appear *ON* the line
- Predicts the value itself
- Predicted value is a continuous value that exceeds 0 or 1
Logistic regression
- Finds a line / plane that separates the data by maximizing the distance
- goal : Line is a no-man's land. New value will appear on *EITHER SIDE* of the line
- predicts which class will the value fall in (sigmoid of the value).
- Predicted value is a discrete value that should be between 0 or 1
- construction: https://vitalflux.com/wp-content/uploads/2022/03/logistic-regression-model-3.png

Things will be more clearer when we look at the loss function 
- regression loss : 
    - loss is higher when it is further away from true target value. 
    - loss happens both ways (since it is continuous value).
    - squared loss curve perfectly captures this behavior
    - Goal : Capture the closeness of values to the original continuous value on both side (positive or negative)
- logistic loss: 
    - loss is higher only for incorrect classifications. 
    - loss happens in one direction (since it is binary classification.)
    - squared loss captures only one direction correctly, the other direction mistakes as "the perfect model also has squared loss, and so the perfect model is the worst model"
    - Goal : Capture the probability of the incorrectly classified values on incorrect side (sign does not matter as long as the dicrete value is an incorrect value. Correct value has 0 loss)
    - we need to eliminate the mistaken side by introducing the logistic function, that only takes range from 0 to 1.

xxxxxxxxxx

Why don't we use linear regression for classification as well?

Linear regression:

- Finds a line that fits and aligns tightly with the data

- goal: line is the trend, any new value will appear *ON* the line

- Predicts the value itself

- Predicted value is a continuous value that exceeds 0 or 1

Logistic regression

- Finds a line / plane that separates the data by maximizing the distance

- goal : Line is a no-man's land. New value will appear on *EITHER SIDE* of the line

- predicts which class will the value fall in (sigmoid of the value).

- Predicted value is a discrete value that should be between 0 or 1

- construction: https://vitalflux.com/wp-content/uploads/2022/03/logistic-regression-model-3.png

Things will be more clearer when we look at the loss function

- regression loss :

    - loss is higher when it is further away from true target value.

    - loss happens both ways (since it is continuous value).

    - squared loss curve perfectly captures this behavior

    - Goal : Capture the closeness of values to the original continuous value on both side (positive or negative)

- logistic loss:

    - loss is higher only for incorrect classifications.

    - loss happens in one direction (since it is binary classification.)

    - squared loss captures only one direction correctly, the other direction mistakes as "the perfect model also has squared loss, and so the perfect model is the worst model"

    - Goal : Capture the probability of the incorrectly classified values on incorrect side (sign does not matter as long as the dicrete value is an incorrect value. Correct value has 0 loss)

    - we need to eliminate the mistaken side by introducing the logistic function, that only takes range from 0 to 1.

Popularity 10/10 Helpfulness 2/10 Language whatever

Source: Grepper

Tags: linear-regression linear-regressio

Link to this answer
Share Copy Link

Contributed on Feb 12 2023

Innocent Iguana

0 Answers Avg Quality 2/10

Logistic Regression in python

Comment

Tip Space Cadet 1 GREPCC

import numpy as np

class LogisticRegression:
    def __init__(self, learning_rate=0.01, num_iterations=10000):
        self.learning_rate = learning_rate
        self.num_iterations = num_iterations
        self.weights = None
        self.bias = None
    
    def fit(self, X, y):
        # initialize weights and bias to zero
        self.weights = np.zeros(X.shape[1])
        self.bias = 0
        
        # gradient descent
        for i in range(self.num_iterations):
            z = np.dot(X, self.weights) + self.bias
            y_pred = self.sigmoid(z)
            
            # calculate gradients
            dw = (1 / X.shape[0]) * np.dot(X.T, (y_pred - y))
            db = (1 / X.shape[0]) * np.sum(y_pred - y)
            
            # update weights and bias
            self.weights -= self.learning_rate * dw
            self.bias -= self.learning_rate * db
    
    def predict(self, X):
        z = np.dot(X, self.weights) + self.bias
        y_pred = self.sigmoid(z)
        return np.round(y_pred)
    
    def sigmoid(self, z):
        return 1 / (1 + np.exp(-z))

xxxxxxxxxx

import numpy as np

class LogisticRegression:

    def __init__(self, learning_rate=0.01, num_iterations=10000):

        self.learning_rate = learning_rate

        self.num_iterations = num_iterations

        self.weights = None

        self.bias = None

    def fit(self, X, y):

        # initialize weights and bias to zero

        self.weights = np.zeros(X.shape[1])

        self.bias = 0

        # gradient descent

        for i in range(self.num_iterations):

            z = np.dot(X, self.weights) + self.bias

            y_pred = self.sigmoid(z)

            # calculate gradients

            dw = (1 / X.shape[0]) * np.dot(X.T, (y_pred - y))

            db = (1 / X.shape[0]) * np.sum(y_pred - y)

            # update weights and bias

            self.weights -= self.learning_rate * dw

            self.bias -= self.learning_rate * db

    def predict(self, X):

        z = np.dot(X, self.weights) + self.bias

        y_pred = self.sigmoid(z)

        return np.round(y_pred)

    def sigmoid(self, z):

        return 1 / (1 + np.exp(-z))

more on : https://pieriantraining.com/machine-learning-with-python-logistic-regression-for-binary-classification/

Popularity 10/10 Helpfulness 2/10 Language python

Source: pieriantraining.com

Tags: logistic-regression logi

Link to this answer
Share Copy Link

Contributed on Apr 13 2023

Space Cadet

0 Answers Avg Quality 2/10

logistic regression

Contents

More Related Answers

logistic regression

Closely Related Answers

logistic regression algorithm

logistic regression python

python logistic regression

logistic regression is a classification algorithm

why logistic regression instead of linear regression

Logistic Regression in python

Grepper

Documentation

Social

Legal

Contact

Oops, You will need to install Grepper and log-in to perform this action.