Setup
Environment configuration and installation
The tutorial/practical will be done using Python.
See the dedicated official Fidle page for instructions about installing the required environment. More details about instructions for your OS:
Cloud computing
To simplify the configuration of your environment (and possibly to have access to a GPU1 for computations during tutorial/practical sessions), we have access to various cloud computing platforms:
- IFB2 cloud:
- connection with institutional account (you should ask to join the “LBMC” group);
- you can run the
Jupyterappliance to get a Jupyter server running Python.
- INSEE3 “datalab SSP cloud”:
- connection with institutional account through “AgentConnect” combined with “Edugained federation”;
- you can run the
Jupyter-pytorch-gpuservice to get a Jupyter server running Python pre-configured withPyTorchand with access to a GPU.
Configuration example on Linux/MacOS
Install Python
Create a dedicated folder on your computer (change the path to your convenience):
mkdir ~/fidle-tp
cd ~/fidle-tp- Create a Python environment for the training:
python3 -m venv fidle-env
source ./fidle-env/bin/activateWhen you open a new terminal/session, you will need to run source ./fidle-env/bin/activate once before using any commands/notebooks to activate the related Python environments.
- Install Pytorch (see the official install helper, especially if you are have a GPU4 or not5)
# source ./fidle-env/bin/activate # only if not done yet
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpufor Mac arm64 you can use the following command:
pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cpu- Install other requirements:
# source ./fidle-env/bin/activate # only if not done yet
pip install torch-geometric torchtext torchdata lightning \
tensorboard keras transformers numpy Scikit-image Scikit-learn \
Matplotlib plotly seaborn barviz pyarrow Pandas Pandoc \
pyyaml Jupyterlab fidle- Install notebooks and datasets (using the
fidcommand from thefidlepackage that we just installed):
# source ./fidle-env/bin/activate # only if not done yet
fid install --quiet- Check your setup:
# source ./fidle-env/bin/activate # only if not done yet
fid check- Start Jupyter lab:
# source ./fidle-env/bin/activate # only if not done yet
jupyter labAnd you are ready to play!
- Reinstall and updates
- You can reinstall notebooks or datasets with commands
fid install_notebooksandfid install_datasets. - You can also modify the datasets fidle folder, you need to specify the path using the environment variable
FIDLE_DATASETS_DIR.
Footnotes
which is not mandatory for the training but can be useful to speed up computations.↩︎
French National Institute of Statistics and Economic Studies↩︎
You will need to install Nvidia drivers and CUDA library, talk to us if you need any help.↩︎
Select “CPU” in that case.↩︎