|
|
0af8bff4a8
|
Added comments and made a start on infer.py
|
2026-03-28 22:31:10 +00:00 |
|
|
|
19c0d4bce3
|
Started dataset.py, added the ReviewDataset class and implemented the __init__, __len__ and __getitem__ methods. The __getitem__ method currently just returns the review text, but will be updated to return the tokenized review as a tensor
|
2026-02-19 18:45:55 +00:00 |
|
|
|
c5e91b79b2
|
Decided on max_length by finding out how many and which reviews would be truncated (it will be 256 tokens)
|
2026-02-19 01:28:10 +00:00 |
|
|
|
0be7da2dde
|
Finally processed the data fully and tested. Moving on to dataset.py and model.py
|
2026-02-19 00:44:36 +00:00 |
|
|
|
608588f023
|
Preprocessed tagged datasets, fixed CSV formatting issues, and added integrity checks. Also saved mappings for later inference use.
|
2026-02-18 22:36:58 +00:00 |
|