Change the repository type filter
All
Repositories list
63 repositories
EfficientViM
PublicOfficial Implementation (Pytorch) of "EfficientViM: Efficient Vision Mamba with Hidden State Mixer-based State Space Duality"LLaMo
PublicOfficial Implementation (Pytorch) of the "LLaMo: Large Language Model-based Molecular Graph Assistant", NeurIPS 2024CAF
PublicVidChain
PublicOfficial Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning", AAAI 2025SugaFormer
PublicDialogGSR
PublicSPoTr
PublicSCDM
PublicOfficial PyTorch implementation of "Stochastic Conditional Diffusion Models for Robust Semantic Image Synthesis" (ICML 2024).InvBO
PublicOfficial Implementation (Pytorch) of "Inversion-based Latent Bayesian Optimization", NeurIPS 2024COSE474_2024
PublicCoBO
PublicOfficial PyTorch Implementation for Advancing Bayesian Optimization via Learning Correlated Latent Space (CoBO)RALF
PublicOfficial implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".ProMetaR
PublicOfficial implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".DAVI-project
PublicDAVI
PublicOfficial Implementation (Pytorch) of "DAVI: Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems", ECCV 2024 Oral paperFlipped-VQA
PublicLarge Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)DDMI
PublicOfficial Implementation (Pytorch) of "DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations", ICLR 2024data303
Publicvid-TLDR
PublicOfficial implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".MCTF
PublicOfficial implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".OVQA
PublicOpen-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 2023)MELTR
PublicMELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)VT-TWINS
PublicVideo-Text Representation Learning via Differentiable Weak Temporal Alignment (CVPR 2022)PDC
PublicSpeaQ
PublicUP-NeRF
PublicOfficial Implementation (PyTorch) of "UP-NeRF: Unconstrained Pose-Prior-Free Neural Radiance Fields", NeurIPS 2023DAPT
PublicDistribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)NuTrea
PublicOfficial implementation of NeurIPS 2023 paper, "NuTrea: Neural Tree Search for Context-guided Multi-hop KGQA".RPO
Public