11 Commits

Author SHA1 Message Date
43cd3c41b9 Fix gitignore and add LFS 2026-04-20 20:44:57 +01:00
dc91b522f0 Added models, outputs, data and tensorflow runs 2026-04-13 01:47:24 +01:00
82e6277cc1 Inference working with cli text, csv text, just need to add timings and I am done 2026-04-01 01:28:14 +01:00
0af8bff4a8 Added comments and made a start on infer.py 2026-03-28 22:31:10 +00:00
5206e62d95 Analysis started, almost complete - compiled some excel sheets from the csv output with notes. Started infer.py, nothing major implemented yet 2026-03-07 18:51:15 +00:00
19c0d4bce3 Started dataset.py, added the ReviewDataset class and implemented the __init__, __len__ and __getitem__ methods. The __getitem__ method currently just returns the review text, but will be updated to return the tokenized review as a tensor 2026-02-19 18:45:55 +00:00
c5e91b79b2 Decided on max_length by finding out how many and which reviews would be truncated (it will be 256 tokens) 2026-02-19 01:28:10 +00:00
0be7da2dde Finally processed the data fully and tested. Moving on to dataset.py and model.py 2026-02-19 00:44:36 +00:00
608588f023 Preprocessed tagged datasets, fixed CSV formatting issues, and added integrity checks. Also saved mappings for later inference use. 2026-02-18 22:36:58 +00:00
charlie-rasberry
b88504725d cleaned notebooks, finished datalabelling 2026-02-16 12:36:29 +00:00
charlie-rasberry
8d3dee6d30 House Cleaning 2026-01-28 16:41:27 +00:00