Try using regression models to predict transposition insertion depending on the genomic length and position
Created by: leilaicruz
It will be nice to apply statistical inference to a our large dataset to try to make meaningful predictions from the data. One example is to predict for example the continous variable of the number of transposon insertions per ORF given the length and the position of it. For this what is recommended is to use Regression models. Whether we need linear or non linear models , we have to "discover it" by looking at the relationships between the variables . For inspiration , and examples you can look in this folder LINK where I have some examples notebooks to try things out