Clusterize FRB profiles using hierarchical clustering, plot corresponding parameters distributions
frbmclust
Getting Started
Clusterize FRB profiles using hierarchical clustering, plot corresponding parameters distributions.
Input files (waterfall data, CHIME catalog) are stored in the resources/
directory, output files are stored in the plots/
directory.
Data used in the clustering belongs to the CHIME/FRB collaboration, CHIME/FRB Catalog 1 Paper.
Prerequisites
-
Python 3.7+
-
The vos Python module for downloading waterfall data: CANFAR storage system
pip install vos --upgrade --user
-
Python code related to DM calculations, estimations, and analysis: FRB
-
For FRB dependencies see: Installing FRB
git clone https://github.com/FRBs/FRB.git cd FRB ; python setup.py install --user ; cd ..
git clone https://github.com/FRBs/ne2001.git cd ne2001 ; python setup.py install --user ; cd ..
-
-
astropy v4.3+
pip install --upgrade pip pip install astropy
-
Installation/upgrade of side packages, which might be required for the version compatibility
pip install --user --upgrade lxml pandas ipython scipy matplotlib seaborn h5py
-
We download
cosmocalc
v0.1.2 and refactor it to be able to run under Python 3 (2to3
software is used and should be installed via package manager):curl https://files.pythonhosted.org/packages/ca/61/df95b65cd3cf3abdffcbeea6c77a6d8964a8fb625a88eccd9c73d2bfdbf0/cosmocalc-0.1.2.tar.gz | tar -x cd cosmocalc-0.1.2 ; 2to3 -w cosmocalc.py ; python setup.py install --user ; cd ..
Configuration
For the fast reproduction of the main results one can run python frb_profiles_clustering.py --use-preprocessed --use-precalculated-z
.
Instead of preprocessed signal data, waterfall data can also be used, which is located in CANFAR storage and which can be downloaded with the -d
flag.
For the data to be used, it shoud be in the resources/
folder and have a following name: {frb_name}_waterfall.h5
To obtain preprocessed signals, run python repack_waterfall.py
. It would download waterfall data from CANFAR storage one at a time and save only calibrated profiles – mean of calibrated waterfall data over each frequency channel.
When calling python frb_profiles_clustering.py
several flags are available:
-d, --download
: download waterfall data from CANFAR storage-ef, --emission_freq
: set emission frequency (MHz) used in brightness temperature calculations. Default value is 600 MHz--use-preprocessed
: use preprocessed signal profiles--use-precalculated-z
: use precalculated red-shift values--use-latex
: enable LaTeX support for plots