<h1>Methods<a class="headerlink" href="#methods" title="Permalink to this headline">¶</a></h1>
<p>Two branches of statistical learning tools are widely used today:</p>
<dl class="simple">
<dt>Unsupervised Learning:</dt><dd><p>An unsupervised learning method takes inbound unlabeled data and extracts or discovers classificiations and labels in the data.</p>
</dd>
<dt>Supervised Learning</dt><dd><p>In a supervised learning context, inbound data are labeled and an analyst builds features that they likely believe will hold predictive power. They might use an understanding of the physics or mechanisms of a system to decide on those features.</p>
</dd>
</dl>
<p>This study explored the use of two supervised learning methods:</p>
<p>Logistic Regression:</p>
<blockquote>
<div><p>A binary classification algorithm which assigns a probability that some set of features could be labeled in a certain way.</p>
<h2>Statistical Learning Process<a class="headerlink" href="#statistical-learning-process" title="Permalink to this headline">¶</a></h2>
<ol class="arabic simple">
<li><p>A class of variables that we want to predict and use to predict are labeled in an existing dataset. The predictive variables are called “estimators”</p></li>
<li><p>The data are partitioned into two parts - a “training” set which is used to develop the model, and a “test” set which is used to validate the model.</p></li>
<li><p>A statistical model is “fit” to the training data, providing a statistical function which could take in new observations and predict the originals labels.</p></li>
<li><p>The model is used to predict the “test” dataset, and the true label values are compared against the predicted label values.</p></li>
<li><p>Performance metrics of the model are calculated against the test data.</p></li>
<li><p>If the model is robust, it could in principle be deployed (either in a programmatic or manual environment) to ingest a future data stream where events are not known a-priori.</p></li>
</ol>
<p>The choice of a statistical model</p>
</div>
<div class="section" id="tooling">
<h2>Tooling<a class="headerlink" href="#tooling" title="Permalink to this headline">¶</a></h2>
<p>The process above was implemented in scikit-learn <a class="bibtex reference internal" href="zreferences.html#scikit-learn" id="id1">[PVG+11]</a>, a popular machine learning library in the python ecosystem. Other tools of note include:</p>
<dl class="simple">
<dt><a class="reference external" href="https://pycaret.org/">pycaret</a></dt><dd><p>PyCaret is a machine learning framework for quickly producing un-optimized a</p>
Press p or to see the previous file or,
n or to see the next file
Comments
Integrate Google Cloud Storage
Use Google Storage
Select bucket
Upload key
Finish
Use Google Cloud Storage!
Browsing data directories saved to Google Cloud Storage is possible with DAGsHub. Let's configure
your repository to easily display your data in the context of any commit!
Specify your Google Storage bucket
Congratulations!
CaReCur is now integrated with Google Cloud Storage!
Delete Storage Key
Are you sure you want to delete this access key?
No
Yes
Integrate AWS S3
Use S3 remote
Select bucket
Access key
Finish
Use AWS S3 as storage!
Browsing data directories saved to S3 is possible with DAGsHub. Let's configure
your repository to easily display your data in the context of any commit!
Specify your S3 bucket
Select Region
af-south-1 - Africa (Cape Town)
ap-northeast-1 - Asia Pacific (Tokyo)
ap-northeast-2 - Asia Pacific (Seoul)
ap-south-1 - Asia Pacific (Mumbai)
ap-southeast-1 - Asia Pacific (Singapore)
ap-southeast-2 - Asia Pacific (Sydney)
ca-central-1 - Canada (Central)
eu-central-1 - EU (Frankfurt)
eu-north-1 - EU (Stockholm)
eu-west-1 - EU (Ireland)
eu-west-2 - EU (London)
eu-west-3 - EU (Paris)
sa-east-1 - South America (São Paulo)
us-east-1 - US East (N. Virginia)
us-east-2 - US East (Ohio)
us-gov-east-1 - US Gov East 1
us-gov-west-1 - US Gov West 1
us-west-1 - US West (N. California)
us-west-2 - US West (Oregon)
Congratulations!
CaReCur is now integrated with AWS S3!
Delete Storage Key
Are you sure you want to delete this access key?
No
Yes
Integrate S3 compatible storage
Use S3 like remote
Select bucket
Access key
Finish
Use any S3 compatible storage!
Browsing data directories saved to S3 compatible storage is possible with DAGsHub. Let's configure
your repository to easily display your data in the context of any commit!
Specify your S3 bucket
Congratulations!
CaReCur is now integrated with your S3 compatible storage!
Delete Storage Key
Are you sure you want to delete this access key?
No
Yes
Integrate Azure Cloud Storage
Use Azure Storage
Select bucket
Set key
Finish
Use Azure Cloud Storage!
Browsing data directories saved to Azure Cloud Storage is possible with DAGsHub. Let's configure
your repository to easily display your data in the context of any commit!
Specify your Azure Storage bucket
Congratulations!
CaReCur is now integrated with Azure Cloud Storage!