I am working on a crash data which I have 14 input and 1 out put. my out put has 2 categories: 1. injury 2. fatality
number of second category is low and my model is not able to reporduce them so I oversampled my data but I am not able to find the best input formula to provide inside fitnlm so, my results in terms of accuracy is very low, is there any specific toolbox or any specific way to find out how to provide best input function or initial betas for fitnlm?
this is the way that I organize my data,
fun1 ='C15 ~ ((b2*C2) + (b3*C3) + b44*exp(b4*C4) + (b5*C5^2) + exp(b8*C8) + (b13*C13^2) + (b12*C12) + (b913*C9*C13) +(b7*C7) + (b108*C10*C8)) / (1+b90*C9) '
mdl1 =fitnlm([INC.C2 INC.C3 INC.C4 INC.C5 INC.C8 INC.C13 INC.C12 INC.C7 INC.C9 INC.C10], INC.C15, fun1, beta01)
fit1=predict(mdl1,[INC.C2 INC.C3 INC.C4 INC.C5 INC.C8 INC.C13 INC.C12 INC.C7 INC.C9 INC.C10]);