Merge branch 'develop'

updating poetry installation instructions
2020-03-09 12:26:00 +00:00 · 2020-03-09 12:25:45 +00:00 · 2020-03-09 12:22:43 +00:00 · 2020-03-09 12:22:35 +00:00 · 2020-03-09 12:12:28 +00:00 · 2020-03-09 12:12:11 +00:00
2 changed files with 30 additions and 0 deletions
--- a/README.rst
+++ b/README.rst
@@ -70,6 +70,27 @@ In the root of the repo in a virtual environment run:

    python ./setup.py install

+poetry
+------
+
+Clone the repo:
+
+.. code-block:: bash
+
+    git clone https://github.com/dtomlinson91/musicbrainzapi-cv-airelogic.git
+
+In a virtual environment install poetry:
+
+.. code-block:: bash
+
+    pip install poetry
+
+In the root of the repo in a virtual environment run:
+
+.. code-block:: bash
+
+    poetry install --no-dev
+
 Docker
 ------

--- a/docs/source/comments.rst
+++ b/docs/source/comments.rst
@@ -115,3 +115,12 @@ Although inelegant, and not guaranteed to capture the specific behaviour we want

 Musicbrainz provides a schema for their api. If this were to be placed in a production environment then readdressing this should be a priority - we should be checking the values returned, using the schema as a guide, and replacing missing values accordingly. We should not rely on ``try except`` blocks to do this as it can be unreliable and is prone to raise other errors.

+Further statistical analysis
+----------------------------
+
+Standard descriptive statistics are provided. I did consider including a more deeper analysis but opted not to for several reasons:
+
+- Without a specific problem or question to answer - explorative work can take a lot of time and may not yield satisfactory results. Questions I did consider are:
+
+    + `For active artists, based on their previous lyrics count what is the predicition of their next album?` Although a sensible question I'm not sure how useful the predicition would be - I am sure for some artists they would follow a pattern over time, but I'm not convinced all artists would and I imagine the results would be mixed. 
+    + `Anomaly detection - for artists with large releases, what albums stood out as larger than usual and what feature (or track) caused this anomaly?` - This would be a good question to answer and we have many tools available. As we have numeric data - clustering could be a candidate (DBSCAN or even K-MEANS). I opted not to because of time and the fact it would bloat the requirements up. Feature flags are an option when handling extra packages, ``pip install musicbrainzapi[analysis]`` for example, but nonetheless this would be an interesting question to answer and I beleive one of the easier ones to implement if it was desired.
Author	SHA1	Message	Date
dtomlinson	cd8117343b	Merge branch 'develop'	2020-03-09 12:26:00 +00:00
dtomlinson	8dc88f6361	updating poetry installation instructions	2020-03-09 12:25:45 +00:00
dtomlinson	306eb82237	Merge branch 'develop'	2020-03-09 12:22:43 +00:00
dtomlinson	0a77fa34fd	adding poetry to installation instructions	2020-03-09 12:22:35 +00:00
dtomlinson	5aefcc2a2d	Merge branch 'develop'	2020-03-09 12:12:28 +00:00
dtomlinson	0034340d63	updating comments document	2020-03-09 12:12:11 +00:00
dtomlinson	02cb79c4b2	Merge branch 'master' into develop	2020-03-09 11:57:22 +00:00
dtomlinson	26b346d359	Merge branch 'documentation'	2020-03-09 11:56:19 +00:00
dtomlinson	78544673b4	Merge branch 'develop'	2020-03-09 11:38:49 +00:00
dtomlinson	e8ce4b59f8	Merge branch 'documentation' into develop	2020-03-09 11:38:36 +00:00