Scale-mae: A scale-aware masked autoencoder for multiscale geospatial representation learning CJ Reed, R Gupta, S Li, S Brockman, C Funk, B Clipp, K Keutzer, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 56 | 2023 |
Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data S Li, H Singh, A Grover arXiv preprint arXiv:2402.05892, 2024 | 14 | 2024 |
Hierarchical open-vocabulary universal image segmentation X Wang, S Li, K Kallidromitis, Y Kato, K Kozuka, T Darrell Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS), 2023 | 14 | 2023 |
Refine and represent: Region-to-object representation learning A Gokul, K Kallidromitis, S Li, Y Kato, K Kozuka, T Darrell, CJ Reed arXiv preprint arXiv:2208.11821, 2022 | 6 | 2022 |
InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following S Li, H Singh, A Grover arXiv preprint arXiv:2312.06738, 2023 | 2 | 2023 |
Aligning Diffusion Models by Optimizing Human Utility S Li, K Kallidromitis, A Gokul, Y Kato, K Kozuka arXiv preprint arXiv:2404.04465, 2024 | | 2024 |
xT: Nested Tokenization for Larger Context in Large Images R Gupta, S Li, T Zhu, J Malik, T Darrell, K Mangalam arXiv preprint arXiv:2403.01915, 2024 | | 2024 |