Two-Stream Transformer Architecture for Long Form Video Understanding E Fish, J Weinbren, A Gilbert 33rd British Machine Vision Conference (BMVC) 2022, 2022 | 17* | 2022 |
Rethinking Genre Classification with Fine-grained Semantic Clustering E Fish, J Weinbren, A Gilbert 2021 IEEE International Conference on Image Processing (ICIP), 1274-1278, 2021 | 17* | 2021 |
A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization E Fish, U Michieli, M Ozay Interspeech 2023, 2023 | 4 | 2023 |
Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization E Fish, J Weinbren, A Gilbert NeurIPS 2023 (ML for Audio Workshop), 2310.03456, 2023 | 1 | 2023 |
Method for Personalisation of ASR Models E Fish, U Michieli, M Ozay US Patent App. 18/405,666, 2024 | | 2024 |
Advancing Efficiency and Accessibility in Multimodal Video Understanding with Deep Learning E Fish University of Surrey, 2024 | | 2024 |
Prompt Learning with Optimal Transport for Few-Shot Temporal Action Localization E Fish, J Weinbren, A Gilbert arXiv preprint arXiv:2403.18915, 2024 | | 2024 |
PLOT-TAL-Prompt Learning with Optimal Transport for Few-Shot Temporal Action Localization-Supplementary Material E Fish, J Weinbren, A Gilbert | | |