*[CEGM1000 MUDE](http://mude.citg.tudelft.nl/): Week 1.8, Friday, Oct 25, 2024.*
GA 1.8 investigates the effect of dependence by applying multivariate distributions to the data sets from the previous assignment, GA 1.7.
There are several files to be aware of:
1.`README.md`
2. 3 subdirectories containing the files for each of the 3 data sets, containing the "usual" files (`Analysis.ipynb`, `Report.md` and `*.csv`)
First, you should **choose a dataset** to use for GA 1.7. We recommend using the same thing you did last week, but you are welcome to switch if you like. See the README for GA 1.7 for a refresher on the context.
It might be useful to start out by adding your notebook from last week to your repository (copy/paste > commit > push!).
Note that the Report lists specific distribution that you should choose from, which are unique for each data set; **read the Report to see which distributions you should choose from!**
As in previous weeks:
- complete the analysis in `Analysis.ipynb` then answer the questions in `Report.md` (but some group members can start working on the Report in parallel!).
- Markdown cells in the notebook are primarily for your own benefit; answers in the Report are more important.
When **submitting your assignment** make sure your `Analysis.ipynb` and `Report.md` files are in the root directory of the repository (i.e., in the same folder as this README file).
## Task Overview
We recommend leaving around 45-60 minutes at the end of the session to work on the Report.
Many of the tasks in the notebook, as well as some of the Report, can be completed independently, here are a few tips:
- Inspect the data together and write down a few observations that you all agree on. This will become your first answer in the Report.
- You can split up and implement the code needed for many of the notebook tasks (some of the code from the WS will be useful to recycle).
- Once the analysis is complete, meet with **all* group members and observe the results, then decide on the "best" distribution (write down a few reasons to add to the justification in the Report)
If you are short on time, remember that it is more important to have answers in the Report than a completed notebook. Explain any issues you faced with the programming in your Report.