Merge branch 'develop'

updating poetry installation instructions
2020-03-09 12:26:00 +00:00 · 2020-03-09 12:25:45 +00:00 · 2020-03-09 12:22:43 +00:00 · 2020-03-09 12:22:35 +00:00 · 2020-03-09 12:12:28 +00:00 · 2020-03-09 12:12:11 +00:00
2 changed files with 30 additions and 0 deletions
--- a/README.rst
+++ b/README.rst
@@ -70,6 +70,27 @@ In the root of the repo in a virtual environment run:
    python ./setup.py install
 poetry
 ------
 Clone the repo:
 .. code-block:: bash
    git clone https://github.com/dtomlinson91/musicbrainzapi-cv-airelogic.git
 In a virtual environment install poetry:
 .. code-block:: bash
    pip install poetry
 In the root of the repo in a virtual environment run:
 .. code-block:: bash
    poetry install --no-dev
 Docker
 ------
--- a/docs/source/comments.rst
+++ b/docs/source/comments.rst
@@ -115,3 +115,12 @@ Although inelegant, and not guaranteed to capture the specific behaviour we want
 Musicbrainz provides a schema for their api. If this were to be placed in a production environment then readdressing this should be a priority - we should be checking the values returned, using the schema as a guide, and replacing missing values accordingly. We should not rely on ``try except`` blocks to do this as it can be unreliable and is prone to raise other errors.
 Further statistical analysis
 ----------------------------
 Standard descriptive statistics are provided. I did consider including a more deeper analysis but opted not to for several reasons:
 - Without a specific problem or question to answer - explorative work can take a lot of time and may not yield satisfactory results. Questions I did consider are:
    + `For active artists, based on their previous lyrics count what is the predicition of their next album?` Although a sensible question I'm not sure how useful the predicition would be - I am sure for some artists they would follow a pattern over time, but I'm not convinced all artists would and I imagine the results would be mixed. 
    + `Anomaly detection - for artists with large releases, what albums stood out as larger than usual and what feature (or track) caused this anomaly?` - This would be a good question to answer and we have many tools available. As we have numeric data - clustering could be a candidate (DBSCAN or even K-MEANS). I opted not to because of time and the fact it would bloat the requirements up. Feature flags are an option when handling extra packages, ``pip install musicbrainzapi[analysis]`` for example, but nonetheless this would be an interesting question to answer and I beleive one of the easier ones to implement if it was desired.
Author	SHA1	Message	Date
dtomlinson	cd8117343b	Merge branch 'develop'	2020-03-09 12:26:00 +00:00
dtomlinson	8dc88f6361	updating poetry installation instructions	2020-03-09 12:25:45 +00:00
dtomlinson	306eb82237	Merge branch 'develop'	2020-03-09 12:22:43 +00:00
dtomlinson	0a77fa34fd	adding poetry to installation instructions	2020-03-09 12:22:35 +00:00
dtomlinson	5aefcc2a2d	Merge branch 'develop'	2020-03-09 12:12:28 +00:00
dtomlinson	0034340d63	updating comments document	2020-03-09 12:12:11 +00:00
dtomlinson	02cb79c4b2	Merge branch 'master' into develop	2020-03-09 11:57:22 +00:00
dtomlinson	26b346d359	Merge branch 'documentation'	2020-03-09 11:56:19 +00:00
dtomlinson	78544673b4	Merge branch 'develop'	2020-03-09 11:38:49 +00:00
dtomlinson	e8ce4b59f8	Merge branch 'documentation' into develop	2020-03-09 11:38:36 +00:00