SlideShare a Scribd company logo
Multinomial Logistic Regression Models




January 1, 2013     ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
 Logistic regression CAN handle dependant variables
   with more than two categories
  It is important to note whether the response variable
   is ordinal (consisting of ordered categories like young,
   middle-aged, old) or nominal (dependant is unordered
   like red, blue, black)
  Some multinomial logistic models are appropriate only
   for ordered response
  It is not mathematically necessary to consider the
   natural ordering when modeling ordinal response but,
  Considering the natural ordering
       Leads to a more parsimonious model
       Increase power to detect relationships with other variables

January 1, 2013    ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
 Applying logistic regression considering the natural
   order is done using a modeling technique called the
   “Proportional Odds Model”
  Say the dependant variable Y has 4 states measuring
   the impact of radiation on the human body; fine,
   sick, serious,dead
  Let p1=prob of fine, p2=prob of sick, p3=prob of
   serious, p4=prob of dead
  Let us define a baseline category: fine, since this is
   the normal stage (we shall see why we need this
   later)
January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    What if we break up the modeling of the 4 level
      ordered dependant into 3 binary logistic
      situations: 1 – (fine,sick), 2 – (fine,serious),3 –
      (fine,dead)?
     Then we would have 3 logit equations:
       Log(p2/p1)=B11+B12X1+B13X2
       Log(p3/p1)=B21+B22X1+B23X2
       Log(p4/p1)=B31+B32X1+B33X2
      X is the degree of radiation dummy with 3 levels so
        broken into 2 binary dummies
     So, 9 parameters to be estmated

January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Now consider an alternative model for the same
      situation
     Cumulative logit model:
       L1=log(p1/p2+p3+p4)
       L2=log(p1+p2/p3+p4)
       L3=log(p1+p2+p3/p4)
     The obvious way to introduce covariates is
       L1=B11+B12X1+B13X2
       L2=B21+B22X1+B23X2
       L3=B31+B32X1+B33X2

January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Let us simplyfy the model by specifying that
      the slope parameters are identical over the
      logit equations. Then,
       L1=A1+B1X1+B2X2
       L2=A2+B1X1+B2X2
       L3=A3+B1X1+B2X2
     This is the proportional odds cumulative logit
      model

January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
 Suppose that the categorical outcome is actually a
   categorized version of an unobservable (latent)
   continuous variable which has a logistic distribution
  The continuous scale is divided into five regions by
   four cut-points c1, c2, c3, c4 which are determined by
   nature
  If Z ≤ c1 we observe Y = 1; if c1 < Z ≤ c2 we observe Y =
   2; and so on
  Suppose that the Z is related to the X’s through a linear
   regression
  Then, the coarsened categorical variable would be
   related Y will be related to the X’s by a proportional-
   odds cumulative logit model
January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Let us go back to the model
       L1=A1+B1X1+B2X2
       L2=A2+B1X1+B2X2
       L3=A3+B1X1+B2X2
     Note that Lj is the log-odds of falling into or below category j
      versus falling above it
     Aj is the log-odds of falling into or below category j when X1 =
      X2 = 0
     B1 is the increase in log-odds of falling into or below any
      category associated with a one-unit increase in Xk, holding all
      the other X-variables constant.
     Therefore, a positive slope indicates a tendency for the
      response level to decrease as the variable decreases
January 1, 2013     ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Our example of 4 levels of impact of radiation
      corresponding to 3 levels of radiation

 proc logistic data=radiation_impact;
 freq count;
 class radiation / order=data param=ref ref=first;
 model sickness (order=data descending) = radiation /
   link=logit
 aggregate=(radiation) scale=none;
 run;
January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Freq=count
       This is important for specifying grouped data
       Count is the variable that contains the frequency of
        occurrance of each observation
       In its absence, each row would be considered a
        separate row of data
     Class=radiation
       Specifies that radiation is a classification variable to
        be used in the analysis
       SAS would automatically generate n-1 binary
        dummies for n categories of radiation with param=ref
        option
January 1, 2013    ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Order=data
       Simply tells SAS to arrange the response categories in
          the order they occur in the input data 1,2,3,4
     Param=ref
       This implies that there is going to be dummy coding
          for the classification variable ‘radiation’listed in class
     Ref=first
       Designates the first ordered level, in this case ‘fine’ as
          the reference level

January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Order=data descending
         This tells SAS to reverse the order of the logits
         So, instead of the cumulative logit model being
         L1=log(p1/p2+p3+p4)
         L2=log(p1+p2/p3+p4)
         L3=log(p1+p2+p3/p4), it becomes
         L1=log(p4/p1+p2+p3)
         L2=log(p4+p3/p1+p2)
         L3=log(p4+p3+p2/p1)
     Now, a positive B1 indicates that a higher value of X1
      leads to greater chance of radiation sickness
January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Link=logit
       fits the cumulative logit model when there are more
          than two response categories
     Aggregate=radiation
       Indicates that the goodness of fit statistics are to be
          calculated on the subpopulations of the variable:
          radiation
     Scale=none
       No correction is need for the dispersion parameter
       To understand this, read up. This happens when the
          goodness of fit statistic exceeds its degrees of freedom
          and need to be corrected for
January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    When we fit this model, the first output we
      see:
 Score Test for the Proportional Odds Assumption
 Chi-Square DF         Pr > ChiSq
 17.2866        21     0.6936

  Null hypothesis is that the current proportional-odds
   cumulative logit model is true
  Seems like we fail to reject the null and so can proceed to the
   rest of the output under the current assumption


January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Ultimately we are interested in the predicted
      probabilities
          OUTPUT <OUT=SAS-data-set><options>
     Predicted=
       For a cumulative model, it is the predicted cumulative
          probability (that is, the probability that the response
          variable is less than or equal to the value of _LEVEL_);
     PREDPROBS=I or C
       Individual|I requests the predicted probability of each
        response level.
       CUMULATIVE | C requests the cumulative predicted
        probability of each response level
January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India

More Related Content

What's hot (18)

PDF
Mm2521542158
IJERA Editor
 
PDF
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
inventionjournals
 
PDF
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
inventionjournals
 
PDF
A review of automatic differentiationand its efficient implementation
ssuserfa7e73
 
PDF
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
orajjournal
 
PDF
Linear programming
VARUN KUMAR
 
PDF
Cg32519523
IJERA Editor
 
PPTX
Chapter two
Mohamed Daahir
 
PDF
Fractional calculus and applications
PlusOrMinusZero
 
PDF
An Approach to Mathematically Establish the Practical Use of Assignment Probl...
ijtsrd
 
PPT
Linear Programming 1
irsa javed
 
PDF
Heptagonal Fuzzy Numbers by Max Min Method
YogeshIJTSRD
 
PDF
Linear programming class 12 investigatory project
Divyans890
 
PDF
A new approach for ranking of intuitionistic fuzzy numbers
Journal of Fuzzy Extension and Applications
 
PDF
linear programming
DagnaygebawGoshme
 
PDF
Certified global minima
ssuserfa7e73
 
PDF
Dr azimifar pattern recognition lect2
Zahra Amini
 
PDF
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
Navodaya Institute of Technology
 
Mm2521542158
IJERA Editor
 
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
inventionjournals
 
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
inventionjournals
 
A review of automatic differentiationand its efficient implementation
ssuserfa7e73
 
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
orajjournal
 
Linear programming
VARUN KUMAR
 
Cg32519523
IJERA Editor
 
Chapter two
Mohamed Daahir
 
Fractional calculus and applications
PlusOrMinusZero
 
An Approach to Mathematically Establish the Practical Use of Assignment Probl...
ijtsrd
 
Linear Programming 1
irsa javed
 
Heptagonal Fuzzy Numbers by Max Min Method
YogeshIJTSRD
 
Linear programming class 12 investigatory project
Divyans890
 
A new approach for ranking of intuitionistic fuzzy numbers
Journal of Fuzzy Extension and Applications
 
linear programming
DagnaygebawGoshme
 
Certified global minima
ssuserfa7e73
 
Dr azimifar pattern recognition lect2
Zahra Amini
 
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
Navodaya Institute of Technology
 

Viewers also liked (20)

PDF
Boosted Tree-based Multinomial Logit Model for Aggregated Market Data
Jay (Jianqiang) Wang
 
PPTX
The power of RapidMiner, showing the direct marketing demo
Wessel Luijben
 
PPTX
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Josh Patterson
 
PDF
Statisticsfor businessproject solution
huynguyenbac
 
DOCX
Financialmodeling
Talal Tahir
 
PDF
Chapter 16
Matthew L Levy
 
PPTX
Midterm
Wilkes U
 
PDF
Ordinal Logistic Regression
Al-Ahmadgaid Asaad
 
PDF
Logistic Regression/Markov Chain presentation
Michael Hankin
 
PPT
Qam formulas
Ashu Jain
 
PDF
Transparency7
A M
 
PPTX
1.5.1 measures basic concepts
A M
 
PPTX
Regression
Ali Raza
 
PDF
(마더세이프 라운드) Logistic regression
mothersafe
 
PDF
Regression: A skin-deep dive
abulyomon
 
PPTX
ForecastIT 2. Linear Regression & Model Statistics
DeepThought, Inc.
 
PPTX
Generalized Logistic Regression - by example (Anthony Kilili)
Anthony Kilili
 
PDF
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
mohamedchaouche
 
PDF
C2.1 intro
Daniel LIAO
 
PPTX
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
Aakash Bagchi
 
Boosted Tree-based Multinomial Logit Model for Aggregated Market Data
Jay (Jianqiang) Wang
 
The power of RapidMiner, showing the direct marketing demo
Wessel Luijben
 
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Josh Patterson
 
Statisticsfor businessproject solution
huynguyenbac
 
Financialmodeling
Talal Tahir
 
Chapter 16
Matthew L Levy
 
Midterm
Wilkes U
 
Ordinal Logistic Regression
Al-Ahmadgaid Asaad
 
Logistic Regression/Markov Chain presentation
Michael Hankin
 
Qam formulas
Ashu Jain
 
Transparency7
A M
 
1.5.1 measures basic concepts
A M
 
Regression
Ali Raza
 
(마더세이프 라운드) Logistic regression
mothersafe
 
Regression: A skin-deep dive
abulyomon
 
ForecastIT 2. Linear Regression & Model Statistics
DeepThought, Inc.
 
Generalized Logistic Regression - by example (Anthony Kilili)
Anthony Kilili
 
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
mohamedchaouche
 
C2.1 intro
Daniel LIAO
 
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
Aakash Bagchi
 
Ad

Similar to Logistic regression for ordered dependant variable with more than 2 levels (20)

PPTX
Multinomial Logistic Regression Analysis
HARISH Kumar H R
 
PDF
4_logit_printable_.pdf
Elio Laureano
 
PDF
goodness of fit in ordinal regression.pdf
HenokBuno
 
PPT
M8.logreg.ppt
TanyaWadhwani4
 
PPT
M8.logreg.ppt
SuaibDanish
 
PPTX
Logistic regression - one of the key regression tools in experimental research
Adrian Olszewski
 
PDF
Logistic regression
Ayurdata
 
PPTX
ERF Training Workshop Panel Data 5
Economic Research Forum
 
PPTX
7. The sixCategorical data analysis.pptx
AbasAhmed7
 
PDF
Ordinal logistic regression
Dr Athar Khan
 
PPTX
Topic3-Qualitative and Limited Dependent Variables.pptx
Warrenchiwoko
 
PPT
whitehead-logistic-regression.ppt
19DSMA012HarshSingh
 
PPTX
ECONOMETRICS-LBS-2025-#11-LOGIT (1).pptx
dishu0796
 
DOC
Ch 56669 Slides.doc.2234322344443222222344
ohenebabismark508
 
PPT
fundamentals of -logistic-regression ppt
DharmishthaChaudhari
 
PPTX
Predictive analytics and Type of Predictive Analytics
Abhishek Job
 
PPT
logistic-regression with examples000.ppt
Shradha Gupta
 
PDF
Application Of Ordinal Logistic Regression In The Study Of Students Performance
Martha Brown
 
PDF
Logistics regression
SALWAidrissiakhannou
 
PDF
Binary OR Binomial logistic regression
Dr Athar Khan
 
Multinomial Logistic Regression Analysis
HARISH Kumar H R
 
4_logit_printable_.pdf
Elio Laureano
 
goodness of fit in ordinal regression.pdf
HenokBuno
 
M8.logreg.ppt
TanyaWadhwani4
 
M8.logreg.ppt
SuaibDanish
 
Logistic regression - one of the key regression tools in experimental research
Adrian Olszewski
 
Logistic regression
Ayurdata
 
ERF Training Workshop Panel Data 5
Economic Research Forum
 
7. The sixCategorical data analysis.pptx
AbasAhmed7
 
Ordinal logistic regression
Dr Athar Khan
 
Topic3-Qualitative and Limited Dependent Variables.pptx
Warrenchiwoko
 
whitehead-logistic-regression.ppt
19DSMA012HarshSingh
 
ECONOMETRICS-LBS-2025-#11-LOGIT (1).pptx
dishu0796
 
Ch 56669 Slides.doc.2234322344443222222344
ohenebabismark508
 
fundamentals of -logistic-regression ppt
DharmishthaChaudhari
 
Predictive analytics and Type of Predictive Analytics
Abhishek Job
 
logistic-regression with examples000.ppt
Shradha Gupta
 
Application Of Ordinal Logistic Regression In The Study Of Students Performance
Martha Brown
 
Logistics regression
SALWAidrissiakhannou
 
Binary OR Binomial logistic regression
Dr Athar Khan
 
Ad

More from Arup Guha (7)

PPT
Structural breaks, unit root tests and long time series
Arup Guha
 
PDF
Partnership with a Premier Business School
Arup Guha
 
PPTX
Beer Market Analytics Solutions
Arup Guha
 
PPTX
Database marketing
Arup Guha
 
PPTX
How principal components analysis is different from factor
Arup Guha
 
PPTX
Using survival analysis results
Arup Guha
 
PPT
Measuring Actual Effect Of Tv Ads On Sales Lk
Arup Guha
 
Structural breaks, unit root tests and long time series
Arup Guha
 
Partnership with a Premier Business School
Arup Guha
 
Beer Market Analytics Solutions
Arup Guha
 
Database marketing
Arup Guha
 
How principal components analysis is different from factor
Arup Guha
 
Using survival analysis results
Arup Guha
 
Measuring Actual Effect Of Tv Ads On Sales Lk
Arup Guha
 

Recently uploaded (20)

PDF
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
PDF
Java 25 and Beyond - A Roadmap of Innovations
Ana-Maria Mihalceanu
 
PPTX
UserCon Belgium: Honey, VMware increased my bill
stijn40
 
PPTX
Enabling the Digital Artisan – keynote at ICOCI 2025
Alan Dix
 
PPTX
CapCut Pro Crack For PC Latest Version {Fully Unlocked} 2025
pcprocore
 
PPTX
Smarter Governance with AI: What Every Board Needs to Know
OnBoard
 
PDF
Open Source Milvus Vector Database v 2.6
Zilliz
 
PDF
The Growing Value and Application of FME & GenAI
Safe Software
 
PDF
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
PDF
Why aren't you using FME Flow's CPU Time?
Safe Software
 
PDF
LLM Search Readiness Audit - Dentsu x SEO Square - June 2025.pdf
Nick Samuel
 
PDF
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
PPTX
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
 
PDF
Redefining Work in the Age of AI - What to expect? How to prepare? Why it mat...
Malinda Kapuruge
 
PDF
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
Earley Information Science
 
PPTX
MARTSIA: A Tool for Confidential Data Exchange via Public Blockchain - Poster...
Michele Kryston
 
PDF
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
PPTX
01_Approach Cyber- DORA Incident Management.pptx
FinTech Belgium
 
DOCX
Daily Lesson Log MATATAG ICT TEchnology 8
LOIDAALMAZAN3
 
PDF
“MPU+: A Transformative Solution for Next-Gen AI at the Edge,” a Presentation...
Edge AI and Vision Alliance
 
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
Java 25 and Beyond - A Roadmap of Innovations
Ana-Maria Mihalceanu
 
UserCon Belgium: Honey, VMware increased my bill
stijn40
 
Enabling the Digital Artisan – keynote at ICOCI 2025
Alan Dix
 
CapCut Pro Crack For PC Latest Version {Fully Unlocked} 2025
pcprocore
 
Smarter Governance with AI: What Every Board Needs to Know
OnBoard
 
Open Source Milvus Vector Database v 2.6
Zilliz
 
The Growing Value and Application of FME & GenAI
Safe Software
 
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
Why aren't you using FME Flow's CPU Time?
Safe Software
 
LLM Search Readiness Audit - Dentsu x SEO Square - June 2025.pdf
Nick Samuel
 
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
 
Redefining Work in the Age of AI - What to expect? How to prepare? Why it mat...
Malinda Kapuruge
 
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
Earley Information Science
 
MARTSIA: A Tool for Confidential Data Exchange via Public Blockchain - Poster...
Michele Kryston
 
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
01_Approach Cyber- DORA Incident Management.pptx
FinTech Belgium
 
Daily Lesson Log MATATAG ICT TEchnology 8
LOIDAALMAZAN3
 
“MPU+: A Transformative Solution for Next-Gen AI at the Edge,” a Presentation...
Edge AI and Vision Alliance
 

Logistic regression for ordered dependant variable with more than 2 levels

  • 1. Multinomial Logistic Regression Models January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 2.  Logistic regression CAN handle dependant variables with more than two categories  It is important to note whether the response variable is ordinal (consisting of ordered categories like young, middle-aged, old) or nominal (dependant is unordered like red, blue, black)  Some multinomial logistic models are appropriate only for ordered response  It is not mathematically necessary to consider the natural ordering when modeling ordinal response but,  Considering the natural ordering  Leads to a more parsimonious model  Increase power to detect relationships with other variables January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 3.  Applying logistic regression considering the natural order is done using a modeling technique called the “Proportional Odds Model”  Say the dependant variable Y has 4 states measuring the impact of radiation on the human body; fine, sick, serious,dead  Let p1=prob of fine, p2=prob of sick, p3=prob of serious, p4=prob of dead  Let us define a baseline category: fine, since this is the normal stage (we shall see why we need this later) January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 4. What if we break up the modeling of the 4 level ordered dependant into 3 binary logistic situations: 1 – (fine,sick), 2 – (fine,serious),3 – (fine,dead)?  Then we would have 3 logit equations:  Log(p2/p1)=B11+B12X1+B13X2  Log(p3/p1)=B21+B22X1+B23X2  Log(p4/p1)=B31+B32X1+B33X2 X is the degree of radiation dummy with 3 levels so broken into 2 binary dummies  So, 9 parameters to be estmated January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 5. Now consider an alternative model for the same situation  Cumulative logit model:  L1=log(p1/p2+p3+p4)  L2=log(p1+p2/p3+p4)  L3=log(p1+p2+p3/p4)  The obvious way to introduce covariates is  L1=B11+B12X1+B13X2  L2=B21+B22X1+B23X2  L3=B31+B32X1+B33X2 January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 6. Let us simplyfy the model by specifying that the slope parameters are identical over the logit equations. Then,  L1=A1+B1X1+B2X2  L2=A2+B1X1+B2X2  L3=A3+B1X1+B2X2  This is the proportional odds cumulative logit model January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 7.  Suppose that the categorical outcome is actually a categorized version of an unobservable (latent) continuous variable which has a logistic distribution  The continuous scale is divided into five regions by four cut-points c1, c2, c3, c4 which are determined by nature  If Z ≤ c1 we observe Y = 1; if c1 < Z ≤ c2 we observe Y = 2; and so on  Suppose that the Z is related to the X’s through a linear regression  Then, the coarsened categorical variable would be related Y will be related to the X’s by a proportional- odds cumulative logit model January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 8. Let us go back to the model  L1=A1+B1X1+B2X2  L2=A2+B1X1+B2X2  L3=A3+B1X1+B2X2  Note that Lj is the log-odds of falling into or below category j versus falling above it  Aj is the log-odds of falling into or below category j when X1 = X2 = 0  B1 is the increase in log-odds of falling into or below any category associated with a one-unit increase in Xk, holding all the other X-variables constant.  Therefore, a positive slope indicates a tendency for the response level to decrease as the variable decreases January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 9. Our example of 4 levels of impact of radiation corresponding to 3 levels of radiation proc logistic data=radiation_impact; freq count; class radiation / order=data param=ref ref=first; model sickness (order=data descending) = radiation / link=logit aggregate=(radiation) scale=none; run; January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 10. Freq=count  This is important for specifying grouped data  Count is the variable that contains the frequency of occurrance of each observation  In its absence, each row would be considered a separate row of data  Class=radiation  Specifies that radiation is a classification variable to be used in the analysis  SAS would automatically generate n-1 binary dummies for n categories of radiation with param=ref option January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 11. Order=data  Simply tells SAS to arrange the response categories in the order they occur in the input data 1,2,3,4  Param=ref  This implies that there is going to be dummy coding for the classification variable ‘radiation’listed in class  Ref=first  Designates the first ordered level, in this case ‘fine’ as the reference level January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 12. Order=data descending  This tells SAS to reverse the order of the logits  So, instead of the cumulative logit model being  L1=log(p1/p2+p3+p4)  L2=log(p1+p2/p3+p4)  L3=log(p1+p2+p3/p4), it becomes  L1=log(p4/p1+p2+p3)  L2=log(p4+p3/p1+p2)  L3=log(p4+p3+p2/p1)  Now, a positive B1 indicates that a higher value of X1 leads to greater chance of radiation sickness January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 13. Link=logit  fits the cumulative logit model when there are more than two response categories  Aggregate=radiation  Indicates that the goodness of fit statistics are to be calculated on the subpopulations of the variable: radiation  Scale=none  No correction is need for the dispersion parameter  To understand this, read up. This happens when the goodness of fit statistic exceeds its degrees of freedom and need to be corrected for January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 14. When we fit this model, the first output we see: Score Test for the Proportional Odds Assumption Chi-Square DF Pr > ChiSq 17.2866 21 0.6936  Null hypothesis is that the current proportional-odds cumulative logit model is true  Seems like we fail to reject the null and so can proceed to the rest of the output under the current assumption January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 15. Ultimately we are interested in the predicted probabilities OUTPUT <OUT=SAS-data-set><options>  Predicted=  For a cumulative model, it is the predicted cumulative probability (that is, the probability that the response variable is less than or equal to the value of _LEVEL_);  PREDPROBS=I or C  Individual|I requests the predicted probability of each response level.  CUMULATIVE | C requests the cumulative predicted probability of each response level January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India