Commit Graph

  • 43cd3c41b9 Fix gitignore and add LFS main archive charlie-rasberry 2026-04-20 20:44:57 +01:00
  • a83cc9804a Remove pycache and add to gitignore charlie-rasberry 2026-04-13 01:48:36 +01:00
  • dc91b522f0 Added models, outputs, data and tensorflow runs charlie-rasberry 2026-04-13 01:47:24 +01:00
  • 1cca27e0b8 added further documentation across all files charlie-rasberry 2026-04-05 14:19:57 +01:00
  • 7fa67af6c0 added timestamps to infer.py and tested it charlie-rasberry 2026-04-01 02:33:47 +01:00
  • 82e6277cc1 Inference working with cli text, csv text, just need to add timings and I am done charlie-rasberry 2026-04-01 01:28:14 +01:00
  • 1e8ea39287 inference shouldn't need much to complete charlie-rasberry 2026-03-29 00:17:05 +00:00
  • 72c27aca13 Merge branch 'main' of github.com:charlie-rasberry/6013 Pushed diagram via web charlie-rasberry 2026-03-28 22:34:44 +00:00
  • 0af8bff4a8 Added comments and made a start on infer.py charlie-rasberry 2026-03-28 22:31:10 +00:00
  • 6145ca0333 Rename architecture_gh.png to architecture.png Charlie Rasberry 2026-03-26 18:32:09 +00:00
  • ef557e663a Add files via upload Charlie Rasberry 2026-03-26 18:31:43 +00:00
  • 753723694b Added some comments and readability charlie-rasberry 2026-03-24 18:11:31 +00:00
  • afe61eaaa2 Merge branch 'main' of github.com:charlie-rasberry/6013 charlie-rasberry 2026-03-07 18:52:38 +00:00
  • 5206e62d95 Analysis started, almost complete - compiled some excel sheets from the csv output with notes. Started infer.py, nothing major implemented yet charlie-rasberry 2026-03-07 18:51:15 +00:00
  • a8aaa077c7 Delete src/__pycache__ directory Charlie Rasberry 2026-02-26 20:39:59 +00:00
  • cabf8aa9b5 Fixed evaluation indentation and other bugs charlie-rasberry 2026-02-26 20:39:19 +00:00
  • 99896c0873 Added evaluation pipeline charlie-rasberry 2026-02-26 20:15:19 +00:00
  • 96a0c45e84 Added implementation for single task roberta, using args for everything made it simple charlie-rasberry 2026-02-26 18:21:13 +00:00
  • 01e2142276 Fixed a few issues with performance data collection and debugging output, mtl training is ready, moving on to single-task training to compare in write-up charlie-rasberry 2026-02-26 17:40:37 +00:00
  • df6aec7165 ignore runs charlie-rasberry 2026-02-23 16:29:44 +00:00
  • 9467ea2519 ignore tensorboard logs and model checkpoints charlie-rasberry 2026-02-23 16:28:43 +00:00
  • 4f0c54fe28 Added training loop for the MTL architecture on the original distribution charlie-rasberry 2026-02-23 16:26:48 +00:00
  • 7bd68108d0 Implemented initial training structure, adding further logic soon including loss, stopping, optimisation and loop charlie-rasberry 2026-02-23 12:54:23 +00:00
  • 76d9b8509b Model almost complete, need to work on loss functions soon charlie-rasberry 2026-02-20 19:17:22 +00:00
  • cccd91a680 Small bit of progress towards model.py, now building forward() charlie-rasberry 2026-02-20 18:18:17 +00:00
  • 61df4e3e26 Implemented dataset.py which tokenises and returns tensors, ready to load the model now charlie-rasberry 2026-02-19 22:10:25 +00:00
  • 19c0d4bce3 Started dataset.py, added the ReviewDataset class and implemented the __init__, __len__ and __getitem__ methods. The __getitem__ method currently just returns the review text, but will be updated to return the tokenized review as a tensor charlie-rasberry 2026-02-19 18:45:55 +00:00
  • 19bcf2aa18 Started dataset.py, added the ReviewDataset class and implemented the __init__, __len__ and __getitem__ methods. The __getitem__ method currently just returns the review text, but will be updated to return the tokenized review as a tensor charlie-rasberry 2026-02-19 18:41:37 +00:00
  • c5e91b79b2 Decided on max_length by finding out how many and which reviews would be truncated (it will be 256 tokens) charlie-rasberry 2026-02-19 01:28:10 +00:00
  • 0be7da2dde Finally processed the data fully and tested. Moving on to dataset.py and model.py charlie-rasberry 2026-02-19 00:44:36 +00:00
  • 608588f023 Preprocessed tagged datasets, fixed CSV formatting issues, and added integrity checks. Also saved mappings for later inference use. charlie-rasberry 2026-02-18 22:36:58 +00:00
  • 94a9fa1f17 gitignore change charlie-rasberry 2026-02-16 16:51:32 +00:00
  • 8dbc5e7fc1 Remove duplicate repository structure heading Charlie Rasberry 2026-02-16 12:42:16 +00:00
  • c006b2fcff Fix formatting in README for repository structure Charlie Rasberry 2026-02-16 12:41:53 +00:00
  • b88504725d cleaned notebooks, finished datalabelling charlie-rasberry 2026-02-16 12:36:29 +00:00
  • 8d3dee6d30 House Cleaning charlie-rasberry 2026-01-28 16:41:27 +00:00
  • 6cf36faf64 Line ending issue with my setup charlie-rasberry 2025-12-19 07:19:02 +00:00
  • 487be5cd27 Everything is good to go for annotations. charlie-rasberry 2025-12-19 07:14:13 +00:00
  • 5b9fbfc75e data processing pipeline now finished just need to annotate reviews charlie-rasberry 2025-11-22 09:41:12 +00:00
  • 45ec02fa46 Moving on to multitag.py, sampling complete I think charlie-rasberry 2025-11-12 06:21:16 +00:00
  • 2cbdd55243 Fixed get_stratified_sample() and replace broken x() with actual working logic, added sample_with_keywords(). charlie-rasberry 2025-11-12 02:05:20 +00:00
  • a178284ffc Added multitag.py (65% complete), preprocess.py (complete), sampler.py (80% complete) charlie-rasberry 2025-11-09 01:45:09 +00:00
  • 4d6e2511e6 Added multitag, includes preprocess.py, sampler.py and multitag.py(the main gui for labelling/annotation) charlie-rasberry 2025-11-06 17:40:29 +00:00
  • c0d4c13824 Ignore large CSV data files charlie-rasberry 2025-11-06 17:39:26 +00:00
  • cf6f2d8371 uber review charlie-rasberry 2025-10-08 00:19:49 +01:00
  • 3c51a51331 initial commit charlie-rasberry 2025-10-07 23:47:48 +01:00