{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Atomic Mass Evaluation (2016)\n", "\n", "In this brief notebook, we use the processed AME data to explore and visualize some attributes. Let us start by importing the necessary packages.\n", "\n", "**NOTE: This notebook is not meant to be a complete exploration resource. You are responsible for exploring and validating the data.**" ] }, { "cell_type": "code", "execution_count": 1, "metadata": { "ExecuteTime": { "end_time": "2021-05-05T21:52:48.689331Z", "start_time": "2021-05-05T21:52:48.685830Z" } }, "outputs": [], "source": [ "# # PROTOTYPE\n", "# import sys\n", "# sys.path.append(\"../..\")" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "ExecuteTime": { "end_time": "2021-05-05T21:52:51.355907Z", "start_time": "2021-05-05T21:52:49.813316Z" } }, "outputs": [], "source": [ "import pandas as pd\n", "import seaborn as sns\n", "import matplotlib.pyplot as plt\n", "import os\n", "\n", "pd.set_option('display.max_columns', 500)\n", "\n", "import nucml.datasets as nuc_data" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "ExecuteTime": { "end_time": "2021-05-05T21:52:52.177112Z", "start_time": "2021-05-05T21:52:52.173610Z" } }, "outputs": [], "source": [ "# This is were our figures will be stored\n", "fig_dir = \"Figures/\"" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Loading Merged AME Files with Natural Data AND with and without NaNs" ] }, { "cell_type": "code", "execution_count": 4, "metadata": { "ExecuteTime": { "end_time": "2021-05-05T21:52:55.004018Z", "start_time": "2021-05-05T21:52:54.908518Z" } }, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "INFO:root:AME: Reading and loading Atomic Mass Evaluation files from: \n", " C:/Users/Pedro/Desktop/ML_Nuclear_Data/AME/CSV_Files\\AME_all_merged.csv\n", "INFO:root:AME: Reading and loading Atomic Mass Evaluation files from: \n", " C:/Users/Pedro/Desktop/ML_Nuclear_Data/AME/CSV_Files\\AME_Natural_Properties_no_NaN.csv\n" ] } ], "source": [ "ame = nuc_data.load_ame()\n", "ame_filled = nuc_data.load_ame(natural=True, imputed_nan=True)" ] }, { "cell_type": "code", "execution_count": 5, "metadata": { "ExecuteTime": { "end_time": "2021-05-05T21:52:55.923825Z", "start_time": "2021-05-05T21:52:55.917325Z" } }, "outputs": [ { "data": { "text/plain": [ "(3436, 65)" ] }, "execution_count": 5, "metadata": {}, "output_type": "execute_result" } ], "source": [ "ame.shape" ] }, { "cell_type": "code", "execution_count": 6, "metadata": { "ExecuteTime": { "end_time": "2021-05-05T21:52:57.460824Z", "start_time": "2021-05-05T21:52:57.453322Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "46.16% of the rows have missing values.\n" ] } ], "source": [ "# How many rows with missing values exists?\n", "rows_w_missing = ame[ame.isnull().any(axis=1)].shape[0]\n", "print(\"{:.2f}% of the rows have missing values.\".format(100 * (rows_w_missing/ame.shape[0])))" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We can get some quick statistics for each numerical feature of the AME dataset. The `count` value shows approximatedly how many missing values are per feature." ] }, { "cell_type": "code", "execution_count": 16, "metadata": { "ExecuteTime": { "end_time": "2021-02-23T20:05:48.389845Z", "start_time": "2021-02-23T20:05:48.236847Z" } }, "outputs": [ { "data": { "text/html": [ "
\n", " | N | \n", "Z | \n", "A | \n", "Mass_Excess | \n", "dMass_Excess | \n", "Binding_Energy | \n", "dBinding_Energy | \n", "B_Decay_Energy | \n", "dB_Decay_Energy | \n", "Atomic_Mass_Micro | \n", "dAtomic_Mass_Micro | \n", "S(2n) | \n", "dS(2n) | \n", "S(2p) | \n", "dS(2p) | \n", "Q(a) | \n", "dQ(a) | \n", "Q(2B-) | \n", "dQ(2B-) | \n", "Q(ep) | \n", "dQ(ep) | \n", "Q(B-n) | \n", "dQ(B-n) | \n", "S(n) | \n", "dS(n) | \n", "S(p) | \n", "dS(p) | \n", "Q(4B-) | \n", "dQ(4B-) | \n", "Q(d,a) | \n", "dQ(d,a) | \n", "Q(p,a) | \n", "dQ(p,a) | \n", "Q(n,a) | \n", "dQ(n,a) | \n", "Q(g,p) | \n", "Q(g,n) | \n", "Q(g,pn) | \n", "Q(g,d) | \n", "Q(g,t) | \n", "Q(g,He3) | \n", "Q(g,2p) | \n", "Q(g,2n) | \n", "Q(g,a) | \n", "Q(p,n) | \n", "Q(p,2p) | \n", "Q(p,pn) | \n", "Q(p,d) | \n", "Q(p,2n) | \n", "Q(p,t) | \n", "Q(p,3He) | \n", "Q(n,2p) | \n", "Q(n,np) | \n", "Q(n,d) | \n", "Q(n,2n) | \n", "Q(n,t) | \n", "Q(n,3He) | \n", "Q(d,t) | \n", "Q(d,3He) | \n", "Q(3He,t) | \n", "Q(3He,a) | \n", "Q(t,a) | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
count | \n", "3436.000000 | \n", "3436.000000 | \n", "3436.000000 | \n", "3436.000000 | \n", "3436.000000 | \n", "3436.000000 | \n", "3436.000000 | \n", "3141.000000 | \n", "3141.000000 | \n", "3.436000e+03 | \n", "3436.000000 | \n", "3199.000000 | \n", "3199.000000 | \n", "3081.000000 | \n", "3081.000000 | \n", "3298.000000 | \n", "3298.000000 | \n", "2848.000000 | \n", "2848.000000 | \n", "2964.000000 | \n", "2964.000000 | \n", "3023.000000 | \n", "3023.000000 | \n", "3318.000000 | \n", "3318.000000 | \n", "3258.000000 | \n", "3258.000000 | \n", "2284.000000 | \n", "2284.000000 | \n", "3367.000000 | \n", "3367.000000 | \n", "3331.000000 | \n", "3331.000000 | \n", "3195.000000 | \n", "3195.000000 | \n", "3258.000000 | \n", "3318.000000 | \n", "3367.000000 | \n", "3367.000000 | \n", "3331.000000 | \n", "3195.000000 | \n", "3081.000000 | \n", "3199.000000 | \n", "3298.000000 | \n", "3141.000000 | \n", "3258.000000 | \n", "3318.000000 | \n", "3318.000000 | \n", "3023.000000 | \n", "3199.000000 | \n", "3367.000000 | \n", "2964.000000 | \n", "3258.000000 | \n", "3258.000000 | \n", "3318.000000 | \n", "3367.000000 | \n", "3081.000000 | \n", "3318.000000 | \n", "3258.000000 | \n", "3141.000000 | \n", "3318.000000 | \n", "3258.000000 | \n", "
mean | \n", "82.034051 | \n", "57.857392 | \n", "139.891444 | \n", "-24144.120957 | \n", "123.588536 | \n", "7959.806728 | \n", "1.838921 | \n", "-100.991337 | \n", "155.851106 | \n", "1.398655e+08 | \n", "132.654412 | \n", "15464.510253 | \n", "156.349187 | \n", "13711.908945 | \n", "153.876813 | \n", "-1028.352902 | \n", "141.749175 | \n", "-158.406791 | \n", "148.478968 | \n", "-6755.747156 | \n", "150.761589 | \n", "-7807.666279 | \n", "151.842233 | \n", "7755.557459 | \n", "164.216263 | \n", "6869.773398 | \n", "160.008898 | \n", "-324.475306 | \n", "162.191690 | \n", "11414.314298 | \n", "175.347868 | \n", "5894.960498 | \n", "169.043993 | \n", "6792.835383 | \n", "160.967815 | \n", "-6869.773398 | \n", "-7755.557459 | \n", "-14656.779602 | \n", "-12432.213602 | \n", "-13918.904402 | \n", "-13784.784017 | \n", "-13711.908945 | \n", "-15464.510253 | \n", "-1028.352902 | \n", "-883.337837 | \n", "-6869.773398 | \n", "-7755.557459 | \n", "-5530.991459 | \n", "-8590.012779 | \n", "-6982.715353 | \n", "-6938.739202 | \n", "-5973.400656 | \n", "-6869.773398 | \n", "-4645.207398 | \n", "-7755.557459 | \n", "-6174.984602 | \n", "-5993.868545 | \n", "-1498.328459 | \n", "-1376.298998 | \n", "-119.583337 | \n", "12822.061941 | \n", "12944.091502 | \n", "
std | \n", "43.293558 | \n", "27.809406 | \n", "70.599410 | \n", "56200.705700 | \n", "197.547987 | \n", "738.982115 | \n", "15.031735 | \n", "8063.858254 | \n", "239.079983 | \n", "7.063095e+07 | \n", "212.043923 | \n", "6550.042919 | \n", "242.338186 | \n", "10078.915472 | \n", "235.628998 | \n", "6989.405614 | \n", "233.060316 | \n", "14319.652534 | \n", "217.688173 | \n", "12278.480250 | \n", "225.980065 | \n", "10536.785128 | \n", "229.821244 | \n", "3631.746683 | \n", "254.377651 | \n", "5444.802214 | \n", "250.538488 | \n", "23373.457604 | \n", "203.785478 | \n", "3977.016875 | \n", "264.596607 | \n", "4615.611122 | \n", "257.035502 | \n", "8215.124941 | \n", "246.518545 | \n", "5444.802214 | \n", "3631.746683 | \n", "3977.016875 | \n", "3977.016875 | \n", "4615.611122 | \n", "8215.124941 | \n", "10078.915472 | \n", "6550.042919 | \n", "6989.405614 | \n", "8063.858254 | \n", "5444.802214 | \n", "3631.746683 | \n", "3631.746683 | \n", "10536.785128 | \n", "6550.042919 | \n", "3977.016875 | \n", "12278.480250 | \n", "5444.802214 | \n", "5444.802214 | \n", "3631.746683 | \n", "3977.016875 | \n", "10078.915472 | \n", "3631.746683 | \n", "5444.802214 | \n", "8063.858254 | \n", "3631.746683 | \n", "5444.802214 | \n", "
min | \n", "0.000000 | \n", "0.000000 | \n", "1.000000 | \n", "-91652.853000 | \n", "0.000000 | \n", "-2267.000000 | \n", "0.000000 | \n", "-28945.000000 | \n", "0.000000 | \n", "1.007825e+06 | \n", "0.000000 | \n", "-3120.000000 | \n", "0.000000 | \n", "-7630.000000 | \n", "0.000000 | \n", "-25474.730000 | \n", "0.000000 | \n", "-37359.770000 | \n", "0.000000 | \n", "-52959.000000 | \n", "0.000000 | \n", "-39622.000000 | \n", "0.000000 | \n", "-2488.000000 | \n", "0.000000 | \n", "-4527.000000 | \n", "0.000000 | \n", "-59615.000000 | \n", "0.140000 | \n", "-4128.000000 | \n", "0.000000 | \n", "-13545.000000 | \n", "0.000000 | \n", "-26083.000000 | \n", "0.000000 | \n", "-31008.000000 | \n", "-27715.000000 | \n", "-30199.093900 | \n", "-27974.527900 | \n", "-33358.864900 | \n", "-46660.619400 | \n", "-55187.000000 | \n", "-40541.000000 | \n", "-25474.730000 | \n", "-29727.346500 | \n", "-31008.000000 | \n", "-27715.000000 | \n", "-25490.434000 | \n", "-40404.346500 | \n", "-32059.205100 | \n", "-22481.053500 | \n", "-52176.653500 | \n", "-31008.000000 | \n", "-28783.434000 | \n", "-27715.000000 | \n", "-21717.298900 | \n", "-47468.959600 | \n", "-21457.771000 | \n", "-25514.525600 | \n", "-28963.592000 | \n", "-7137.380600 | \n", "-11194.135100 | \n", "
25% | \n", "47.000000 | \n", "36.000000 | \n", "84.000000 | \n", "-65400.443000 | \n", "3.005750 | \n", "7730.734250 | \n", "0.024000 | \n", "-5356.454000 | \n", "6.406000 | \n", "8.392781e+07 | \n", "3.226500 | \n", "11068.100000 | \n", "5.400000 | \n", "5819.440000 | \n", "6.550000 | \n", "-6262.357500 | \n", "3.782500 | \n", "-9937.682500 | \n", "8.100000 | \n", "-14125.250000 | \n", "7.277500 | \n", "-14561.930000 | \n", "7.100000 | \n", "5414.787500 | \n", "5.140000 | \n", "2772.250000 | \n", "5.842500 | \n", "-17485.152500 | \n", "11.670000 | \n", "8767.635000 | \n", "6.160000 | \n", "3353.955000 | \n", "5.900000 | \n", "1986.590000 | \n", "6.190000 | \n", "-9949.867500 | \n", "-9654.657500 | \n", "-17303.458900 | \n", "-15078.892900 | \n", "-16459.909900 | \n", "-18591.029400 | \n", "-19423.390000 | \n", "-19082.195000 | \n", "-6262.357500 | \n", "-6138.800500 | \n", "-9949.867500 | \n", "-9654.657500 | \n", "-7430.091500 | \n", "-15344.276500 | \n", "-10600.400100 | \n", "-9585.418500 | \n", "-13342.903500 | \n", "-9949.867500 | \n", "-7725.301500 | \n", "-9654.657500 | \n", "-8821.663900 | \n", "-11705.349600 | \n", "-3397.428500 | \n", "-4456.393100 | \n", "-5375.046000 | \n", "10922.961900 | \n", "9863.997400 | \n", "
50% | \n", "81.000000 | \n", "58.000000 | \n", "139.000000 | \n", "-39335.024000 | \n", "14.474000 | \n", "8073.000000 | \n", "0.094000 | \n", "-858.530000 | \n", "27.916000 | \n", "1.389127e+08 | \n", "15.537500 | \n", "14614.250000 | \n", "23.320000 | \n", "11755.380000 | \n", "26.170000 | \n", "-430.320000 | \n", "18.690000 | \n", "-1685.250000 | \n", "28.090000 | \n", "-4924.250000 | \n", "28.120000 | \n", "-8243.100000 | \n", "27.950000 | \n", "7249.960000 | \n", "25.005000 | \n", "5799.335000 | \n", "24.815000 | \n", "-2903.405000 | \n", "42.070000 | \n", "11813.880000 | \n", "28.990000 | \n", "6308.000000 | \n", "26.760000 | \n", "8407.020000 | \n", "27.600000 | \n", "-5799.335000 | \n", "-7249.960000 | \n", "-14257.213900 | \n", "-12032.647900 | \n", "-13505.864900 | \n", "-12170.599400 | \n", "-11755.380000 | \n", "-14614.250000 | \n", "-430.320000 | \n", "-1640.876500 | \n", "-5799.335000 | \n", "-7249.960000 | \n", "-5025.394000 | \n", "-9025.446500 | \n", "-6132.455100 | \n", "-6539.173500 | \n", "-4141.903500 | \n", "-5799.335000 | \n", "-3574.769000 | \n", "-7249.960000 | \n", "-5775.418900 | \n", "-4037.339600 | \n", "-992.731000 | \n", "-305.860600 | \n", "-877.122000 | \n", "13327.659400 | \n", "14014.529900 | \n", "
75% | \n", "114.000000 | \n", "80.000000 | \n", "194.000000 | \n", "1172.301000 | \n", "196.000000 | \n", "8367.375000 | \n", "1.000000 | \n", "4700.000000 | \n", "236.000000 | \n", "1.939668e+08 | \n", "211.000000 | \n", "19082.195000 | \n", "228.000000 | \n", "19423.390000 | \n", "229.590000 | \n", "4561.017500 | \n", "200.000000 | \n", "8501.250000 | \n", "263.052500 | \n", "1947.500000 | \n", "251.000000 | \n", "-1284.525000 | \n", "235.500000 | \n", "9654.657500 | \n", "269.057500 | \n", "9949.867500 | \n", "232.250000 | \n", "14141.572500 | \n", "298.000000 | \n", "14414.505000 | \n", "284.000000 | \n", "9053.995000 | \n", "277.000000 | \n", "13247.000000 | \n", "259.275000 | \n", "-2772.250000 | \n", "-5414.787500 | \n", "-11656.588900 | \n", "-9432.022900 | \n", "-10759.869900 | \n", "-7330.619400 | \n", "-5819.440000 | \n", "-11068.100000 | \n", "4561.017500 | \n", "3917.653500 | \n", "-2772.250000 | \n", "-5414.787500 | \n", "-3190.221500 | \n", "-2066.871500 | \n", "-2586.305100 | \n", "-3938.548500 | \n", "2729.846500 | \n", "-2772.250000 | \n", "-547.684000 | \n", "-5414.787500 | \n", "-3174.793900 | \n", "1898.600400 | \n", "842.441500 | \n", "2721.224400 | \n", "4681.408000 | \n", "15162.831900 | \n", "17041.614900 | \n", "
max | \n", "177.000000 | \n", "118.000000 | \n", "295.000000 | \n", "201512.000000 | \n", "2003.000000 | \n", "8794.553000 | \n", "667.000000 | \n", "31687.000000 | \n", "2003.000000 | \n", "2.952163e+08 | \n", "2150.000000 | \n", "40541.000000 | \n", "2003.000000 | \n", "55187.000000 | \n", "2014.000000 | \n", "11920.000000 | \n", "2042.000000 | \n", "52098.000000 | \n", "2003.000000 | \n", "28352.000000 | \n", "2003.000000 | \n", "31755.000000 | \n", "2003.000000 | \n", "27715.000000 | \n", "2011.000000 | \n", "31008.000000 | \n", "2832.000000 | \n", "77377.000000 | \n", "2019.000000 | \n", "23846.530000 | \n", "2830.000000 | \n", "21413.860000 | \n", "2003.000000 | \n", "24299.000000 | \n", "2830.000000 | \n", "4527.000000 | \n", "2488.000000 | \n", "-2224.563900 | \n", "0.002100 | \n", "1599.995100 | \n", "3721.380600 | \n", "7630.000000 | \n", "3120.000000 | \n", "11920.000000 | \n", "30904.653500 | \n", "4527.000000 | \n", "2488.000000 | \n", "4712.566000 | \n", "30972.653500 | \n", "11601.794900 | \n", "5493.476500 | \n", "29134.346500 | \n", "4527.000000 | \n", "6751.566000 | \n", "2488.000000 | \n", "6257.231100 | \n", "15348.040400 | \n", "8745.229000 | \n", "10020.474400 | \n", "31668.408000 | \n", "23065.619400 | \n", "24340.864900 | \n", "