More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation | Quanfu Fan · Chun-Fu (Richard) Chen · Hilde Kuehne · Marco Pistoia · David Cox |
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition | Jinwoo Choi · Chen Gao · Joseph C. E. Messou · Jia-Bin Huang |
Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos | Yitian Yuan · Lin Ma · Jingwen Wang · Wei Liu · Wenwu Zhu |
U-Time: A Fully Convolutional Network for Time Series Segmentation Applied to Sleep Staging | Mathias Perslev · Michael Jensen · Sune Darkner · Poul Jørgen Jennum · Christian Igel |