title,authors,arxiv,github,hf_paper,hf_space,hf_model,hf_dataset,arxiv_id,n_authors,n_linked_authors Analyzing Convergence in Quantum Neural Networks: Deviations from Neural Tangent Kernels,"Xuchen You, Shouvanik Chakrabarti, Boyang Chen, Xiaodi Wu",http://arxiv.org/abs/2303.14844,,https://huggingface.co/papers/2303.14844,,,,2303.14844,4,0 BiBench: Benchmarking and Analyzing Network Binarization,"Haotong Qin, Mingyuan Zhang, Yifu Ding, Aoyu Li, Zhongang Cai, Ziwei Liu, Fisher Yu, Xianglong Liu",http://arxiv.org/abs/2301.11233,https://github.com/htqin/BiBench,https://huggingface.co/papers/2301.11233,,,,2301.11233,8,1 Searching Large Neighborhoods for Integer Linear Programs with Contrastive Learning,"Taoan Huang, Aaron Ferber, Yuandong Tian, Bistra Dilkina, Benoit Steiner",http://arxiv.org/abs/2302.01578,,https://huggingface.co/papers/2302.01578,,,,2302.01578,5,1 Optimizing DDPM Sampling with Shortcut Fine-Tuning,"Ying Fan, Kangwook Lee",http://arxiv.org/abs/2301.13362,,https://huggingface.co/papers/2301.13362,,,,2301.13362,2,2 Evolving Semantic Prototype Improves Generative Zero-Shot Learning,"Shiming Chen, Wenjin Hou, Ziming Hong, Xiaohan Ding, Yibing Song, Xinge You, Tongliang Liu, Kun Zhang",http://arxiv.org/abs/2306.06931,,https://huggingface.co/papers/2306.06931,,,,2306.06931,8,1 Model-Bellman Inconsistency for Model-based Offline Reinforcement Learning,"Yihao Sun, Jiaji Zhang, Chengxing Jia, Haoxin Lin, Junyin Ye, Yang Yu",,,,,,,,, CoCo: A Coupled Contrastive Framework for Unsupervised Domain Adaptive Graph Classification,"Nan Yin, Li Shen, Mengzhu Wang, Long Lan, Zeyu Ma, Chong Chen, Xian-Sheng Hua, Xiao Luo, Xiao Luo",http://arxiv.org/abs/2306.04979,,https://huggingface.co/papers/2306.04979,,,,2306.04979,8,0 GFlowNet-EM for Learning Compositional Latent Variable Models,"Edward Hu, Nikolay Malkin, Moksh Jain, Katie Everett, Alexandros Graikos, Yoshua Bengio",http://arxiv.org/abs/2302.06576,,https://huggingface.co/papers/2302.06576,,,,2302.06576,6,1 Optimal LP Rounding and Linear-Time Approximation Algorithms for Clustering Edge-Colored Hypergraphs,Nate Veldt,http://arxiv.org/abs/2208.06506,,https://huggingface.co/papers/2208.06506,,,,2208.06506,1,0 CLUSTSEG: Clustering for Universal Segmentation,"James Liang, Tianfei Zhou, Dongfang Liu, Wenguan Wang",http://arxiv.org/abs/2305.02187,,https://huggingface.co/papers/2305.02187,,,,2305.02187,4,0 Optimizing Mode Connectivity for Class Incremental Learning,"Haitao Wen, haoyang cheng, Heqian Qiu, Lanxiao Wang, Lili Pan, Hongliang Li",,,,,,,,, "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition","Shentong Mo, Pedro Morgado",http://arxiv.org/abs/2305.19458,,https://huggingface.co/papers/2305.19458,,,,2305.19458,2,1 UPSCALE: Unconstrained Channel Pruning,"Alvin Wan, Hanxiang Hao, Kaushik Patnaik, Yueyang Xu, Omer Hadad, David Güera, Zhile Ren, Qi Shan",,,,,,,,, Rethinking Explaining Graph Neural Networks via Non-parametric Subgraph Matching,"Fang Wu, Siyuan Li, Yinghui Jiang, Xurui Jin, Dragomir Radev, Zhangming Niu, Stan Z Li",,,,,,,,, A Complete Expressiveness Hierarchy for Subgraph GNNs via Subgraph Weisfeiler-Lehman Tests,"Bohang Zhang, Guhao Feng, Yiheng Du, Di He, Liwei Wang",http://arxiv.org/abs/2302.07090,,https://huggingface.co/papers/2302.07090,,,,2302.07090,5,0 Which Tricks are Important for Learning to Rank?,"Ivan Lyzhin, Aleksei Ustimenko, Andrey Gulin, Liudmila Prokhorenkova",http://arxiv.org/abs/2204.01500,,https://huggingface.co/papers/2204.01500,,,,2204.01500,4,0 Near-Optimal $\Phi$-Regret Learning in Extensive-Form Games,"Ioannis Anagnostides, Gabriele Farina, Tuomas Sandholm",,,,,,,,, Long Horizon Temperature Scaling,"Andy Shih, Dorsa Sadigh, Stefano Ermon",http://arxiv.org/abs/2302.03686,,https://huggingface.co/papers/2302.03686,,,,2302.03686,3,0 Nesterov Meets Optimism: Rate-Optimal Separable Minimax Optimization,"Chris Junchi Li, Angela Yuan, Gauthier Gidel, Quanquan Gu, Michael Jordan",,,,,,,,, Opponent-Limited Online Search for Imperfect Information Games,"Weiming Liu, Haobo Fu, Qiang Fu, Wei Yang",,,,,,,,, The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms,"Anirudh Vemula, Yuda Song, Aarti Singh, J. Bagnell, Sanjiban Choudhury",http://arxiv.org/abs/2303.00694,,https://huggingface.co/papers/2303.00694,,,,2303.00694,5,0 A Nearly-Optimal Bound for Fast Regression with $\ell_\infty$ Guarantee,"Zhao Song, Mingquan Ye, Junze Yin, Lichen Zhang",http://arxiv.org/abs/2302.00248,,https://huggingface.co/papers/2302.00248,,,,2302.00248,4,0 On the Optimality of Misspecified Kernel Ridge Regression,"Haobo Zhang, Yicheng Li, Weihao Lu, Qian Lin",http://arxiv.org/abs/2305.07241,,https://huggingface.co/papers/2305.07241,,,,2305.07241,4,0 Robust Perception through Equivariance,"Chengzhi Mao, Lingyu Zhang, Abhishek Joshi, Junfeng Yang, Hao Wang, Carl Vondrick",http://arxiv.org/abs/2212.06079,,https://huggingface.co/papers/2212.06079,,,,2212.06079,6,1 Causal Strategic Classification: A Tale of Two Shifts,"Guy Horowitz, Nir Rosenfeld",http://arxiv.org/abs/2302.06280,,https://huggingface.co/papers/2302.06280,,,,2302.06280,2,0 Bit Allocation using Optimization,"Tongda Xu, Han Gao, Chenjian Gao, Yuanyuan Wang, Dailan He, Jinyong Pi, Jixiang Luo, Ziyu Zhu, Mao Ye, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang",http://arxiv.org/abs/2209.09422,https://github.com/tongdaxu/Bit-Allocation-Using-Optimization,https://huggingface.co/papers/2209.09422,,,,2209.09422,13,1 Estimating Possible Causal Effects with Latent Variables via Adjustment,"Tian-Zuo Wang, Tian Qin, Zhi-Hua Zhou",,,,,,,,, Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining,"Zekun Qi, Runpei Dong, Guofan Fan, Zheng Ge, Xiangyu Zhang, Kaisheng Ma, Li Yi",,,,,,,,, Model-Free Robust Average-Reward Reinforcement Learning,"Yue Wang, Alvaro Velasquez, George Atia, Ashley Prater-Bennette, Shaofeng Zou",http://arxiv.org/abs/2305.10504,,https://huggingface.co/papers/2305.10504,,,,2305.10504,5,0 "Learning to acquire novel cognitive tasks with evolution, plasticity and meta-meta-learning",Thomas Miconi,http://arxiv.org/abs/2112.08588,,https://huggingface.co/papers/2112.08588,,,,2112.08588,1,1 Moderately Distributional Exploration for Domain Generalization,"Rui Dai, Yonggang Zhang, zhen fang, Bo Han, Xinmei Tian",http://arxiv.org/abs/2304.13976,,https://huggingface.co/papers/2304.13976,,,,2304.13976,5,0 A General Representation Learning Framework with Generalization Performance Guarantees,"Junbiao Cui, Jianqing Liang, Qin Yue, Jiye Liang",,,,,,,,, On the Impact of Knowledge Distillation for Model Interpretability,"Hyeongrok Han, Siwon Kim, Hyun-Soo Choi, Sungroh Yoon",http://arxiv.org/abs/2305.15734,,https://huggingface.co/papers/2305.15734,,,,2305.15734,4,2 Submodular Order Functions and Assortment Optimization,Rajan Udwani,http://arxiv.org/abs/2107.02743,,https://huggingface.co/papers/2107.02743,,,,2107.02743,1,0 Nearly Optimal Algorithms with Sublinear Computational Complexity for Online Kernel Regression,"JUNFAN LI, Shizhong Liao",http://arxiv.org/abs/2306.08320,,https://huggingface.co/papers/2306.08320,,,,2306.08320,2,0 Is Learning Summary Statistics Necessary for Likelihood-free Inference?,"Yanzhi Chen, Michael Gutmann, Adrian Weller",,,,,,,,, FaDIn: Fast Discretized Inference for Hawkes Processes with General Parametric Kernels,"Guillaume Staerman, Cédric Allain, Alexandre Gramfort, Thomas Moreau",http://arxiv.org/abs/2210.04635,,https://huggingface.co/papers/2210.04635,,,,2210.04635,4,1 Learning Preconditioner for Conjugate Gradient PDE Solvers,"Yichen Li, Peter Yichen Chen, Tao Du, Wojciech Matusik",http://arxiv.org/abs/2305.16432,,https://huggingface.co/papers/2305.16432,,,,2305.16432,4,0 Muse: Text-To-Image Generation via Masked Generative Transformers,"Huiwen Chang, Han Zhang, Jarred Barber, Aaron Maschinot, Jose Lezama, Lu Jiang, Ming-Hsuan Yang, Kevin Murphy, William Freeman, Michael Rubinstein, Yuanzhen Li, Dilip Krishnan",http://arxiv.org/abs/2301.00704,,https://huggingface.co/papers/2301.00704,,,,2301.00704,12,2 Men Also Do Laundry: Multi-Attribute Bias Amplification,"Dora Zhao, Jerone Andrews, Alice Xiang",http://arxiv.org/abs/2210.11924,,https://huggingface.co/papers/2210.11924,,,,2210.11924,3,0 TIPS: Topologically Important Path Sampling for Anytime Neural Networks,"Guihong Li, Kartikeya Bhardwaj, Yuedong Yang, Radu Marculescu",http://arxiv.org/abs/2305.08021,,https://huggingface.co/papers/2305.08021,,,,2305.08021,4,1 Scalable Set Encoding with Universal Mini-Batch Consistency and Unbiased Full Set Gradient Approximation,"Jeffrey Willette, Seanie Lee, Bruno Andreis, Kenji Kawaguchi, Juho Lee, Sung Ju Hwang",http://arxiv.org/abs/2208.12401,,https://huggingface.co/papers/2208.12401,,,,2208.12401,6,1 Efficient and Degree-Guided Graph Generation via Discrete Diffusion Modeling,"Xiaohui Chen, JIAXING HE, Xu Han, Liping Liu",http://arxiv.org/abs/2305.04111,,https://huggingface.co/papers/2305.04111,,,,2305.04111,4,1 Optimal Arms Identification with Knapsacks,"Shaoang Li, Lan Zhang, Yingqi Yu, Xiangyang Li",,,,,,,,, Continual Learners are Incremental Model Generalizers,"Jaehong Yoon, Sung Ju Hwang, Yue Cao",,,,,,,,, From Hypergraph Energy Functions to Hypergraph Neural Networks,"Yuxin Wang, Quan Gan, Xipeng Qiu, Xuanjing Huang, David Wipf",http://arxiv.org/abs/2306.09623,https://github.com/yxzwang/PhenomNN,https://huggingface.co/papers/2306.09623,,,,2306.09623,5,0 Fundamental Tradeoffs in Learning with Prior Information,Anirudha Majumdar,http://arxiv.org/abs/2304.13479,,https://huggingface.co/papers/2304.13479,,,,2304.13479,1,0 ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation,"Kaiwen Zhou, Kaizhi Zheng, Connor Pryor, Yilin Shen, Hongxia Jin, Lise Getoor, Xin Eric Wang",http://arxiv.org/abs/2301.13166,,https://huggingface.co/papers/2301.13166,,,,2301.13166,7,1 Tight and fast generalization error bound of graph embedding in metric space.,"Atsushi Suzuki, Atsushi Nitanda, Taiji Suzuki, Jing Wang, Feng Tian, Kenji Yamanishi",,,,,,,,, Two-Scale Gradient Descent Ascent Dynamics Finds Mixed Nash Equilibria of Continuous Games: A Mean-Field Perspective,Yulong Lu,http://arxiv.org/abs/2212.08791,,https://huggingface.co/papers/2212.08791,,,,2212.08791,1,0 "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation","Wenqing Zheng, S P Sharan, Ajay Jaiswal, Kevin Wang, Yihan Xi, Dejia Xu, Zhangyang “Atlas” Wang",http://arxiv.org/abs/2305.00909,https://github.com/VITA-Group/ChainCoder,https://huggingface.co/papers/2305.00909,,,,2305.00909,7,3 Scaling of Class-wise Training Losses for Post-hoc Calibration,"Seungjin Jung, Seungmo Seo, Yonghyun Jeong, Jongwon Choi",,,,,,,,, SpeedDETR: Speed-aware Transformers for End-to-end Object Detection,"Peiyan Dong, Zhenglun Kong, Xin Meng, PENG ZHANG, hao tang, Yanzhi Wang, Chih-Hsien Chou",,,,,,,,, Learning to Decouple Complex Systems,"Zihan Zhou, Tianshu Yu",http://arxiv.org/abs/2302.01581,,https://huggingface.co/papers/2302.01581,,,,2302.01581,2,0 Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice,"Toshinori Kitamura, Tadashi Kozuno, Yunhao Tang, Nino Vieillard, Michal Valko, Wenhao Yang, Jincheng Mei, Pierre Menard, Mohammad Gheshlaghi Azar, Remi Munos, Olivier Pietquin, Matthieu Geist, Csaba Szepesvari, Wataru Kumagai, Yutaka Matsuo",http://arxiv.org/abs/2305.13185,,https://huggingface.co/papers/2305.13185,,,,2305.13185,15,2 Automatically marginalized MCMC in probabilistic programming,"Jinlin Lai, Javier Burroni, Hui Guan, Daniel Sheldon",http://arxiv.org/abs/2302.00564,,https://huggingface.co/papers/2302.00564,,,,2302.00564,4,1 Nugget: Neural Agglomerative Embeddings of Text,"Guanghui Qin, Benjamin Van Durme",,,,,,,,, Optimal Shrinkage for Distributed Second-Order Optimization,"Fangzhao Zhang, Mert Pilanci",,,,,,,,, Defects of Convolutional Decoder Networks in Frequency Representation,"Ling Tang, Wen Shen, Zhanpeng Zhou, YueFeng Chen, Quanshi Zhang",http://arxiv.org/abs/2210.09020,,https://huggingface.co/papers/2210.09020,,,,2210.09020,5,2 "Understanding Int4 Quantization for Language Models: Latency Speedup, Composability, and Failure Cases","Xiaoxia Wu, Cheng Li, Reza Yazdani Aminabadi, Zhewei Yao, Yuxiong He",,,,,,,,, Graph Neural Networks can Recover the Hidden Features Solely from the Graph Structure,Ryoma Sato,http://arxiv.org/abs/2301.10956,,https://huggingface.co/papers/2301.10956,,,,2301.10956,1,1 Quantum Lower Bounds for Finding Stationary Points of Nonconvex Functions,"Chenyi Zhang, Tongyang Li",http://arxiv.org/abs/2212.03906,,https://huggingface.co/papers/2212.03906,,,,2212.03906,2,0 Synthetic data for model selection,"Alon Shshan, Nadav Bhonker, Igor Kviatkovsky, Igor Kviatkovsky, Matan Fintz, Gerard Medioni",http://arxiv.org/abs/2105.00717,,https://huggingface.co/papers/2105.00717,,,,2105.00717,5,0 Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models,"Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao",,,,,,,,, Learning Affinity with Hyperbolic Representation for Spatial Propagation,"Jin-Hwi Park, JAESUNG CHOE, Inhwan Bae, HAE-GON JEON",,,,,,,,, Revisiting Discriminative vs. Generative Classifiers: Theory and Implications,"Chenyu Zheng, Guoqiang Wu, Fan Bao, Yue Cao, Chongxuan Li, Jun Zhu",http://arxiv.org/abs/2302.02334,https://github.com/ML-GSAI/Revisiting-Dis-vs-Gen-Classifiers,https://huggingface.co/papers/2302.02334,,,,2302.02334,6,1 Differentially Private Stochastic Convex Optimization under a Quantile Loss Function,"Du Chen, Geoffrey Chua",,,,,,,,, Continual Vision-Language Representation Learning with Off-Diagonal Information,"zixuan ni, Longhui Wei, Siliang Tang, Yueting Zhuang, Qi Tian",http://arxiv.org/abs/2305.07437,,https://huggingface.co/papers/2305.07437,,,,2305.07437,5,0 Learning to Learn from APIs: Black-Box Data-Free Meta-Learning,"Zixuan Hu, Li Shen, Zhenyi Wang, Baoyuan Wu, Chun Yuan, Dacheng Tao",http://arxiv.org/abs/2305.18413,,https://huggingface.co/papers/2305.18413,,,,2305.18413,6,1 Learning Functional Distributions with Private Labels,"Changlong Wu, Yifan Wang, Ananth Grama, Wojciech Szpankowski",,,,,,,,, A Kernel Stein Test of Goodness of Fit for Sequential Models,"Jerome Baum, Heishiro Kanagawa, Arthur Gretton",http://arxiv.org/abs/2210.10741,,https://huggingface.co/papers/2210.10741,,,,2210.10741,3,0 DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation,"Yuhang Lai, Chengxi Li, Yiming Wang, Tianyi Zhang, Ruiqi Zhong, Luke Zettlemoyer, Scott Yih, Daniel Fried, Sida Wang, Tao Yu",,,,,,,,, Interpolation for Robust Learning: Data Augmentation on Geodesics,"Jiacheng Zhu, Jielin Qiu, Aritra Guha, Zhuolin Yang, XuanLong Nguyen, Bo Li, Ding Zhao",http://arxiv.org/abs/2302.02092,,https://huggingface.co/papers/2302.02092,,,,2302.02092,7,2 2D-Shapley: A Framework for Fragmented Data Valuation,"Zhihong Liu, Hoang Anh Just, Xiangyu Chang, Xi Chen, Ruoxi Jia",,,,,,,,, MetricGAN-OKD: Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement,"Wooseok Shin, Byung Hoon Lee, Jin Sob Kim, Hyun Joon Park, Sung Won Han",,,,,,,,, Everyone's Preference Changes Differently: A Weighted Multi-Interest Model For Retrieval,"hui shi, Yupeng Gu, Yitong Zhou, Bo Zhao, Sicun Gao, Jishen Zhao",,,,,,,,, Are Equivariant Equilibrium Approximators Beneficial?,"Zhijian Duan, Yunxuan Ma, Xiaotie Deng",http://arxiv.org/abs/2301.11481,,https://huggingface.co/papers/2301.11481,,,,2301.11481,3,0 Understanding Incremental Learning of Gradient Descent: A Fine-grained Analysis of Matrix Sensing,"Jikai Jin, Zhiyuan Li, Kaifeng Lyu, Simon Du, Jason Lee",http://arxiv.org/abs/2301.11500,,https://huggingface.co/papers/2301.11500,,,,2301.11500,5,0 CircuitNet: A Generic Neural Network to Realize Universal Circuit Motif Modeling,"Yansen Wang, XINYANG JIANG, Kan Ren, Caihua Shan, Xufang Luo, Dongqi Han, Kaitao Song, Yifei Shen, Dongsheng Li",,,,,,,,, On the Power of Foundation Models,Yang Yuan,http://arxiv.org/abs/2211.16327,,https://huggingface.co/papers/2211.16327,,,,2211.16327,1,0 Rigid body flows for sampling molecular crystal structures,"Jonas Köhler, Michele Invernizzi, Pim de Haan, Frank Noe",http://arxiv.org/abs/2301.11355,,https://huggingface.co/papers/2301.11355,,,,2301.11355,4,1 PixelAsParam: A Gradient View on Diffusion Sampling with Guidance,"Anh-Dung Dinh, Daochang Liu, Chang Xu",,,,,,,,, Constant Matters: Fine-grained Error Bound on Differentially Private Continual Observation,"Hendrik Fichtenberger, Monika Henzinger, Jalaj Upadhyay",,,,,,,,, Target-Aware Generative Augmentations for Single-Shot Adaptation,"Kowshik Thopalli, Rakshith Subramanyam, Pavan Turaga, Jayaraman J. Thiagarajan",http://arxiv.org/abs/2305.13284,https://github.com/Rakshith-2905/SiSTA,https://huggingface.co/papers/2305.13284,,,,2305.13284,4,2 Underspecification Presents Challenges for Credibility in Modern Machine Learning,"Alexander D'Amour, Katherine Heller, Dan Moldovan, Ben Adlam, Babak Alipanahi, Alex Beutel, Christina Chen, Jonathan Deaton, Jacob Eisenstein, Matthew Hoffman, Farhad Hormozdiari, Neil Houlsby, Shaobo Hou, Ghassen Jerfel, Alan Karthikesalingam, Mario Lucic, Yian Ma, Cory McLean, Diana Mincu, Akinori Mitani, Andrea Montanari, Zachary Nado, Vivek Natarajan, Christopher Nielson, Thomas F. Osborne, Rajiv Raman, Kim Ramasamy, Rory sayres, Jessica Schrouff, Martin Seneviratne, Shannon Sequeira, Harini Suresh, Victor Veitch, Maksym Vladymyrov, Xuezhi Wang, Kellie Webster, Steve Yadlowsky, Taedong Yun, Xiaohua Zhai, D. Sculley",http://arxiv.org/abs/2011.03395,,https://huggingface.co/papers/2011.03395,,,,2011.03395,40,3 Diversity-enhancing Generative Network for Few-shot Hypothesis Adaptation,"Ruijiang Dong, Feng Liu, Haoang Chi, Tongliang Liu, Mingming Gong, Gang Niu, Masashi Sugiyama, Bo Han",,,,,,,,, Large Language Models Struggle to Learn Long-Tail Knowledge,"Nikhil Kandpal, Haikang Deng, Adam Roberts, Eric Wallace, Colin Raffel",http://arxiv.org/abs/2211.08411,,https://huggingface.co/papers/2211.08411,,,,2211.08411,5,3 Adaptive Annealed Importance Sampling with Constant Rate Progress,"Shirin Goshtasbpour, Victor Cohen, Perez Fernando",,,,,,,,, A Coupled Flow Approach to Imitation Learning,"Gideon Freund, Elad Sarafian, Sarit Kraus",http://arxiv.org/abs/2305.00303,,https://huggingface.co/papers/2305.00303,,,,2305.00303,3,3 Learning to Optimize Differentiable Games,"Xuxi Chen, Nelson Vadori, Tianlong Chen, Zhangyang “Atlas” Wang",,,,,,,,, Robust Satisficing MDPs,"Haolin RUAN, Siyu Zhou, Zhi Chen, Chin Pang Ho",,,,,,,,, Accuracy on the Curve: On the Nonlinear Correlation of ML Performance Between Data Subpopulations,"Weixin Liang, Yining Mao, Yongchan Kwon, Xinyu Yang, James Zou",http://arxiv.org/abs/2305.02995,,https://huggingface.co/papers/2305.02995,,,,2305.02995,5,1 Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and MDPs,"Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang",,,,,,,,, Set-membership Belief State-based Reinforcement Learning for POMDPs,"Wei Wei, Lijun Zhang, Lin Li, Huizhong Song, Jiye Liang",,,,,,,,, CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling,"Jun Zhang, Shuyang Jiang, Jiangtao Feng, Lin Zheng, Lingpeng Kong",http://arxiv.org/abs/2210.07661,,https://huggingface.co/papers/2210.07661,,,,2210.07661,5,2 Policy Contrastive Imitation Learning,"Jialei Huang, Zhao-Heng Yin, Yingdong Hu, Yang Gao",,,,,,,,, Averaged Method of Multipliers for Bi-Level Optimization without Lower-Level Strong Convexity,"Risheng Liu, Yaohua Liu, Wei Yao, Shangzhi Zeng, Jin Zhang",http://arxiv.org/abs/2302.03407,,https://huggingface.co/papers/2302.03407,,,,2302.03407,5,1 Building Neural Networks on Matrix Manifolds: A Gyrovector Space Approach,"Xuan Son Nguyen, Shuo Yang",http://arxiv.org/abs/2305.04560,,https://huggingface.co/papers/2305.04560,,,,2305.04560,2,0 Fed-CBS: A Heterogeneity-Aware Client Sampling Mechanism for Federated Learning via Class-Imbalance Reduction,"Jianyi Zhang, Ang Li, Minxue Tang, Jingwei Sun, Xiang Chen, Fan Zhang, Changyou Chen, Yiran Chen, Hai Li",,,,,,,,, Disentangled Multiplex Graph Representation Learning ,"Yujie Mo, Yajie Lei, Jialie SHEN, Xiaoshuang Shi, Heng Tao Shen, Xiaofeng Zhu",,,,,,,,, Shedding a PAC-Bayesian Light on Adaptive Sliced-Wasserstein Distances,"Ruben Ohana, Kimia Nadjahi, alain rakotomamonjy, Ralaivola Liva",http://arxiv.org/abs/2206.03230,,https://huggingface.co/papers/2206.03230,,,,2206.03230,4,1 Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language,"Philipp Seidl, Andreu Vall, Sepp Hochreiter, Günter Klambauer",http://arxiv.org/abs/2303.03363,,https://huggingface.co/papers/2303.03363,,,,2303.03363,4,0 PreNAS: Preferred One-Shot Learning Towards Efficient Neural Architecture Search,"Haibin Wang, Ce Ge, Hesen Chen, XIuyu Sun",http://arxiv.org/abs/2304.14636,https://github.com/tinyvision/PreNAS,https://huggingface.co/papers/2304.14636,,,,2304.14636,4,0 Wasserstein Barycenter Matching for Graph Size Generalization of Message Passing Neural Networks,"Xu Chu, Yujie Jin, Xin Wang, Shanghang Zhang, Yasha Wang, Wenwu Zhu, Hong Mei",,,,,,,,, Learning Distributions over Quantum Measurement Outcomes,"Weiyuan Gong, Scott Aaronson",http://arxiv.org/abs/2209.03007,,https://huggingface.co/papers/2209.03007,,,,2209.03007,2,0 The Power of Preconditioning in Overparameterized Low-Rank Matrix Sensing,"Xingyu Xu, Yandi Shen, Yuejie Chi, Cong Ma",http://arxiv.org/abs/2302.01186,,https://huggingface.co/papers/2302.01186,,,,2302.01186,4,0 Minimal Width for Universal Property of Deep RNN,"Chang hoon Song, Geonho Hwang, Jun ho Lee, Myungjoo Kang",http://arxiv.org/abs/2211.13866,,https://huggingface.co/papers/2211.13866,,,,2211.13866,4,0 Efficient Approximations of Complete Interatomic Potentials for Crystal Property Prediction,"Yuchao Lin, Keqiang Yan, Youzhi Luo, Yi Liu, Xiaoning Qian, Shuiwang Ji",,,,,,,,, IncDSI: Incrementally Updatable Document Retrieval,"Varsha Kishore, Chao Wan, Justin Lovelace, Yoav Artzi, Kilian Weinberger",,,,,,,,, Normalizing Flows for Interventional Density Estimation,"Valentyn Melnychuk, Dennis Frauen, Stefan Feuerriegel",http://arxiv.org/abs/2209.06203,,https://huggingface.co/papers/2209.06203,,,,2209.06203,3,1 The Fast Johnson-Lindenstrauss Transform Is Even Faster,"Ora Nova Fandina, Mikael Møller Høgsgaard, Mikael Høgsgaard, Kasper Green Larsen",http://arxiv.org/abs/2204.01800,,https://huggingface.co/papers/2204.01800,,,,2204.01800,3,0 Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features,"Chieh Lin, Hung-Yu Tseng, Hsin-Ying Lee, Maneesh Singh, Ming-Hsuan Yang",http://arxiv.org/abs/2206.01202,,https://huggingface.co/papers/2206.01202,,,,2206.01202,5,1 What can online reinforcement learning with function approximation benefit from general coverage conditions?,"Fanghui Liu, Luca Viano, Volkan Cevher",http://arxiv.org/abs/2304.12886,,https://huggingface.co/papers/2304.12886,,,,2304.12886,3,0 On the Correctness of Automatic Differentiation for Neural Networks with Machine-Representable Parameters,"Wonyeol Lee, Sejun Park, Alex Aiken",http://arxiv.org/abs/2301.13370,,https://huggingface.co/papers/2301.13370,,,,2301.13370,3,0 FeDXL: Provable Federated Learning for Deep X-Risk Optimization,"Zhishuai Guo, Rong Jin, Jiebo Luo, Tianbao Yang",,,,,,,,, Causal Proxy Models for Concept-based Model Explanations,"Zhengxuan Wu, Karel D'Oosterlinck, Atticus Geiger, Amir Zur, Christopher Potts",http://arxiv.org/abs/2209.14279,https://github.com/frankaging/Causal-Proxy-Model,https://huggingface.co/papers/2209.14279,,,,2209.14279,5,4 Grounding Language Models to Images for Multimodal Inputs and Outputs,"Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried",http://arxiv.org/abs/2301.13823,,https://huggingface.co/papers/2301.13823,,,,2301.13823,3,1 Improved Online Conformal Prediction via Strongly Adaptive Online Learning,"Aadyot Bhatnagar, Huan Wang, Caiming Xiong, Yu Bai",http://arxiv.org/abs/2302.07869,,https://huggingface.co/papers/2302.07869,,,,2302.07869,4,1 A Gromov--Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening,"Yifan Chen, Rentian Yao, Yun Yang, Jie Chen",http://arxiv.org/abs/2306.08854,https://github.com/ychen-stat-ml/GW-Graph-Coarsening,https://huggingface.co/papers/2306.08854,,,,2306.08854,4,0 Approximately Optimal Core Shapes for Tensor Decompositions,"Mehrdad Ghadiri, Matthew Fahrbach, Thomas Fu, Vahab Mirrokni",http://arxiv.org/abs/2302.03886,,https://huggingface.co/papers/2302.03886,,,,2302.03886,4,1 Theoretical Guarantees of Learning Ensembling Strategies with Applications to Time Series Forecasting,"Hilaf Hasson, Danielle Robinson, Yuyang Wang, Gaurav Gupta, Youngsuk Park",http://arxiv.org/abs/2305.15786,,https://huggingface.co/papers/2305.15786,,,,2305.15786,5,1 Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy,"xiyao wang, Wichayaporn Wongkamjan, Ruonan Jia, Furong Huang",http://arxiv.org/abs/2207.12141,,https://huggingface.co/papers/2207.12141,,,,2207.12141,3,3 Towards Deep Attention in Graph Neural Networks: Problems and Remedies,"Soo Yong Lee, Fanchen Bu, Jaemin Yoo, Kijung Shin",http://arxiv.org/abs/2306.02376,https://github.com/syleeheal/AERO-GNN,https://huggingface.co/papers/2306.02376,,,,2306.02376,4,1 SAAL: Sharpness-Aware Active Learning,"Yoon-Yeong Kim, Youngjae Cho, JoonHo Jang, Byeonghu Na, Yeongmin Kim, Kyungwoo Song, Wanmo Kang, IL CHUL MOON",,,,,,,,, Meta Learning of Interface Conditions for Multi-Domain Physics-Informed Neural Networks,"Shibo Li, Michael Penwarden, Yiming Xu, Conor Tillinghast, Akil Narayan, Mike Kirby, Shandian Zhe",http://arxiv.org/abs/2210.12669,,https://huggingface.co/papers/2210.12669,,,,2210.12669,4,0 Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning,"Jiatai Huang, Yan Dai, Longbo Huang",http://arxiv.org/abs/2301.10500,,https://huggingface.co/papers/2301.10500,,,,2301.10500,3,0 Shape-Guided Dual-Memory Learning for 3D Anomaly Detection,"YU-MIN CHU, Chieh Liu, Ting-I Hsieh, Hwann-Tzong Chen, Tyng-Luh Liu",,,,,,,,, SRATTA: Sample Re-ATTribution Attack of Secure Aggregation in Federated Learning.,"Tanguy MARCHAND, Regis Loeb, Ulysse Marteau-Ferey, Jean Ogier du Terrail, Arthur Pignet",,,,,,,,, Hardness of Independent Learning and Sparse Equilibrium Computation in Markov Games,"Dylan Foster, Noah Golowich, Sham Kakade",http://arxiv.org/abs/2303.12287,,https://huggingface.co/papers/2303.12287,,,,2303.12287,3,0 Implicit Jacobian regularization weighted with impurity of probability output,"Sungyoon Lee, Jinseong Park, Jaewook Lee",,,,,,,,, Why do Nearest Neighbor Language Models Work?,"Frank Xu, Uri Alon, Graham Neubig",http://arxiv.org/abs/2301.02828,https://github.com/frankxu2004/knnlm-why,https://huggingface.co/papers/2301.02828,,,,2301.02828,3,3 Revisiting Weighted Aggregation in Federated Learning with Neural Networks,"Zexi Li, Tao Lin, Xinyi Shang, Chao Wu",http://arxiv.org/abs/2302.10911,,https://huggingface.co/papers/2302.10911,,,,2302.10911,4,1 Sliced-Wasserstein on Symmetric Positive Definite Matrices for M/EEG Signals,"Clément Bonet, Benoît Malézieux, alain rakotomamonjy, Lucas Drumetz, Thomas Moreau, Matthieu Kowalski, Nicolas Courty",http://arxiv.org/abs/2303.05798,,https://huggingface.co/papers/2303.05798,,,,2303.05798,7,2 VA-learning as a more efficient alternative to Q-learning,"Yunhao Tang, Remi Munos, Mark Rowland, Michal Valko",,,,,,,,, Improving Hyperparameter Learning under Approximate Inference in Gaussian Process Models,"Rui Li, ST John, Arno Solin",http://arxiv.org/abs/2306.04201,,https://huggingface.co/papers/2306.04201,,,,2306.04201,3,1 HarsanyiNet: Computing Accurate Shapley Values in a Single Forward Propagation,"Lu Chen, Siyu Lou, Keyan Zhang, JIN HUANG, Quanshi Zhang",http://arxiv.org/abs/2304.01811,,https://huggingface.co/papers/2304.01811,,,,2304.01811,5,0 Invariance in Policy Optimisation and Partial Identifiability in Reward Learning,"Joar Skalse, Matthew Farrugia-Roberts, Stuart Russell, Alessandro Abate, Adam Gleave",http://arxiv.org/abs/2203.07475,,https://huggingface.co/papers/2203.07475,,,,2203.07475,5,2 On the Convergence Rate of Gaussianization with Random Rotations,"Felix Draxler, Lars Kühmichel, Jens Müller, Armand Rousselot, Christoph Schnörr, Ullrich Koethe",,,,,,,,, Nonparametric Iterative Machine Teaching,"CHEN ZHANG, Xiaofeng Cao, Weiyang Liu, Ivor Tsang, James Kwok",http://arxiv.org/abs/2306.03007,,https://huggingface.co/papers/2306.03007,,,,2306.03007,5,2 Boosting Graph Contrastive Learning via Graph Contrastive Saliency,"Chunyu Wei, Yu Wang, Bing Bai, Kai Ni, David Brady, LU FANG",,,,,,,,, Neural Wasserstein Gradient Flows for Discrepancies with Riesz Kernels,"Fabian Altekrüger, Johannes Hertrich, Gabriele Steidl",,,,,,,,, Improving Expert Predictions with Conformal Prediction,"Eleni Straitouri, Luke Lequn Wang, Nastaran Okati, Manuel Gomez-Rodriguez",,,,,,,,, Graph Switching Dynamical Systems,"Yongtuo Liu, Sara Magliacane, Miltiadis (Miltos) Kofinas, Efstratios Gavves",http://arxiv.org/abs/2306.00370,,https://huggingface.co/papers/2306.00370,,,,2306.00370,4,1 Achieving High Accuracy with PINNs via Energy Natural Gradient Descent,"Marius Zeinhofer, Johannes Müller",,,,,,,,, On Strengthening and Defending Graph Reconstruction Attack with Markov Chain Approximation,"Zhanke Zhou, Chenyu Zhou, Xuan Li, Jiangchao Yao, QUANMING YAO, Bo Han",http://arxiv.org/abs/2306.09104,https://github.com/tmlr-group/MC-GRA,https://huggingface.co/papers/2306.09104,,,,2306.09104,6,0 "Global Convergence of Sub-gradient Method for Robust Matrix Recovery: Small Initialization, Noisy Measurements, and Over-parameterization","Jianhao Ma, Salar Fattahi",http://arxiv.org/abs/2202.08788,,https://huggingface.co/papers/2202.08788,,,,2202.08788,2,0 Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN,"Siyuan Li, Di Wu, Fang Wu, Zelin Zang, Stan Z Li",http://arxiv.org/abs/2205.13943,,https://huggingface.co/papers/2205.13943,,,,2205.13943,5,1 Doubly Adversarial Federated Bandits,"Jialin Yi, Milan Vojnovic",http://arxiv.org/abs/2301.09223,,https://huggingface.co/papers/2301.09223,,,,2301.09223,2,1 LipsNet: A Smooth and Robust Neural Network with Adaptive Lipschitz Constant for High Accuracy Optimal Control,"Xujie Song, Jingliang Duan, Wenxuan Wang, Shengbo Li, Chen Chen, Bo Cheng, Bo Zhang, Junqing Wei, Xiaoming (Simon) Wang",,,,,,,,, A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning,"Benjamin Eysenbach, Matthieu Geist, Ruslan Salakhutdinov, Sergey Levine",,,,,,,,, WL meet VC,"Christopher Morris, Floris Geerts, Jan M Tönshoff, Martin Grohe",http://arxiv.org/abs/2301.11039,,https://huggingface.co/papers/2301.11039,,,,2301.11039,4,0 Differential Privacy has Bounded Impact on Fairness in Classification,"Paul Mangold, Michaël Perrot, Aurélien Bellet, Marc Tommasi",http://arxiv.org/abs/2210.16242,,https://huggingface.co/papers/2210.16242,,,,2210.16242,4,0 Differentiable Multi-Target Causal Bayesian Experimental Design,"Panagiotis Tigas, Yashas Annadani, Desi Ivanova, Andrew Jesson, Yarin Gal, Adam Foster, Stefan Bauer",http://arxiv.org/abs/2302.10607,,https://huggingface.co/papers/2302.10607,,,,2302.10607,7,1 Learning the Dynamics of Sparsely Observed Interacting Systems,"Linus Bleistein, Adeline Fermanian, Anne-Sophie Jannot, Agathe Guilloux",http://arxiv.org/abs/2301.11647,,https://huggingface.co/papers/2301.11647,,,,2301.11647,4,0 How Jellyfish Characterise Alternating Group Equivariant Neural Networks,Edward Pearce-Crump,http://arxiv.org/abs/2301.10152,,https://huggingface.co/papers/2301.10152,,,,2301.10152,1,1 On the Occupancy Measure of Non-Markovian Policies in Continuous MDPs,"Romain Laroche, Remi Tachet des Combes",,,,,,,,, Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings,"Masatoshi Uehara, Ayush Sekhari, Jason Lee, Nathan Kallus, Wen Sun",http://arxiv.org/abs/2206.12081,,https://huggingface.co/papers/2206.12081,,,,2206.12081,5,0 Uncovering Adversarial Risks of Test-Time Adaptation,"Tong Wu, Feiran Jia, Xiangyu Qi, Jiachen Wang, Vikash Sehwag, Saeed Mahloujifar, Prateek Mittal",http://arxiv.org/abs/2301.12576,,https://huggingface.co/papers/2301.12576,,,,2301.12576,7,4 Offline Reinforcement Learning with Closed-Form Policy Improvement Operators,"Jiachen Li, Edwin Zhang, Ming Yin, Jerry Bai, Yu-Xiang Wang, William Wang",http://arxiv.org/abs/2211.15956,,https://huggingface.co/papers/2211.15956,,,,2211.15956,6,1 Taxonomy-Structured Domain Adaptation,"Tianyi Liu, Zihao Xu, Hao He, Guang-Yuan Hao, Guang-He Lee, Hao Wang",http://arxiv.org/abs/2306.07874,https://github.com/Wang-ML-Lab/TSDA,https://huggingface.co/papers/2306.07874,,,,2306.07874,6,1 Latent Traversals in Generative Models as Potential Flows,"Yue Song, T. Anderson Keller, Nicu Sebe, Max Welling",http://arxiv.org/abs/2304.12944,,https://huggingface.co/papers/2304.12944,,,,2304.12944,4,0 Fast Rates for Maximum Entropy Exploration,"Daniil Tiapkin, Denis Belomestny, Daniele Calandriello, Eric Moulines, Remi Munos, Alexey Naumov, Pierre Perrault, Yunhao Tang, Michal Valko, Pierre Menard",http://arxiv.org/abs/2303.08059,,https://huggingface.co/papers/2303.08059,,,,2303.08059,10,2 MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without Camera Pose,"Yang Fu, Ishan Misra, Xiaolong Wang",http://arxiv.org/abs/2210.07181,,https://huggingface.co/papers/2210.07181,,,,2210.07181,3,0 Are Large Kernels Better Teachers than Transformers for ConvNets?,"Tianjin Huang, Lu Yin, Zhenyu Zhang, Li Shen, Meng Fang, Mykola Pechenizkiy, Zhangyang “Atlas” Wang, Shiwei Liu",,,,,,,,, Learning in POMDPs is Sample-Efficient with Hindsight Observability,"Jonathan Lee, Alekh Agarwal, Christoph Dann, Tong Zhang",http://arxiv.org/abs/2301.13857,,https://huggingface.co/papers/2301.13857,,,,2301.13857,4,1 Quantum Ridgelet Transform: Winning Lottery Ticket of Neural Networks with Quantum Computation,"Hayata Yamasaki, Sathyawageeswar Subramanian, Satoshi Hayakawa, Sho Sonoda",http://arxiv.org/abs/2301.11936,,https://huggingface.co/papers/2301.11936,,,,2301.11936,4,0 MixFlows: principled variational inference via mixed flows,"Zuheng Xu, Naitong Chen, Trevor Campbell",http://arxiv.org/abs/2205.07475,,https://huggingface.co/papers/2205.07475,,,,2205.07475,3,0 Preprocessors Matter! Realistic Decision-Based Attacks on Machine Learning Systems,"Chawin Sitawarin, Florian Tramer, Nicholas Carlini",http://arxiv.org/abs/2210.03297,https://github.com/google-research/preprocessor-aware-black-box-attack,https://huggingface.co/papers/2210.03297,,,,2210.03297,3,2 Efficient Sequence Transduction by Jointly Predicting Tokens and Durations,"Hainan Xu, Fei Jia, Somshubra Majumdar, He Huang, Shinji Watanabe, Boris Ginsburg",http://arxiv.org/abs/2304.06795,https://github.com/NVIDIA/NeMo,https://huggingface.co/papers/2304.06795,,,,2304.06795,6,3 Adaptive Identification of Populations with Treatment Benefit in Clinical Trials: Machine Learning Challenges and Solutions,"Alicia Curth, Alihan Hüyük, Mihaela van der Schaar",http://arxiv.org/abs/2208.05844,,https://huggingface.co/papers/2208.05844,,,,2208.05844,3,0 Graph Ladling: Shockingly Simple Parallel GNN Training without Intermediate Communication,"Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Ding, Zhangyang “Atlas” Wang",,,,,,,,, A Critical Revisit of Adversarial Robustness in 3D Point Cloud Recognition with Diffusion-Driven Purification,"Jiachen Sun, Jiongxiao Wang, Weili Nie, Zhiding Yu, Zhuoqing Morley Mao, Chaowei Xiao",,,,,,,,, COLA: Orchestrating Error Coding and Learning for Robust Neural Network Inference Against Hardware Defects,"Anlan Yu, Ning Lyu, Jieming Yin, Zhiyuan Yan, Wujie Wen",,,,,,,,, A Closer Look at Self-Supervised Lightweight Vision Transformers,"Shaoru Wang, Jin Gao, Zeming Li, Xiaoqin Zhang, Weiming Hu",http://arxiv.org/abs/2205.14443,https://github.com/wangsr126/mae-lite,https://huggingface.co/papers/2205.14443,,,,2205.14443,5,1 Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space,"Anas Barakat, Ilyas Fatkhullin, Niao He",http://arxiv.org/abs/2306.01854,,https://huggingface.co/papers/2306.01854,,,,2306.01854,3,0 Leveraging Offline Data in Online Reinforcement Learning,"Andrew Wagenmaker, Aldo Pacchiano",http://arxiv.org/abs/2211.04974,,https://huggingface.co/papers/2211.04974,,,,2211.04974,2,0 Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression,"Mo Zhou, Rong Ge",http://arxiv.org/abs/2302.00257,,https://huggingface.co/papers/2302.00257,,,,2302.00257,2,0 NeuralSlice: Neural 3D Triangle Mesh Reconstruction via Slicing 4D Tetrahedral Meshes,"Chenbo Jiang, Jie Yang, Shwai He, Yu-Kun Lai, Lin Gao",,,,,,,,, Adversarial Parameter Attack on Deep Neural Networks,"Lijia Yu, Yihan Wang, Xiao-Shan Gao",http://arxiv.org/abs/2203.10502,,https://huggingface.co/papers/2203.10502,,,,2203.10502,3,0 Contrastive Learning Meets Homophily: Two Birds with One Stone,"Dongxiao He, JiTao Zhao, Rui Guo, Zhiyong Feng, Di Jin, Yuxiao Huang, Zhen Wang, Weixiong Zhang",,,,,,,,, Mechanistic Mode Connectivity,"Ekdeep Singh Lubana, Eric Bigelow, Robert Dick, David Krueger, Hidenori Tanaka",http://arxiv.org/abs/2211.08422,,https://huggingface.co/papers/2211.08422,,,,2211.08422,5,1 Interactive Object Placement with Reinforcement Learning,"Shengping Zhang, Quanling Meng, Qinglin Liu, Liqiang Nie, Bineng Zhong, Xiaopeng Fan, Rongrong Ji",,,,,,,,, Towards credible visual model interpretation with path attribution,"Naveed Akhtar, Mohammad Jalwana",http://arxiv.org/abs/2305.14395,,https://huggingface.co/papers/2305.14395,,,,2305.14395,2,0 Controllability-Aware Unsupervised Skill Discovery,"Seohong Park, Kimin Lee, Youngwoon Lee, Pieter Abbeel",http://arxiv.org/abs/2302.05103,,https://huggingface.co/papers/2302.05103,,,,2302.05103,4,1 Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments,"Runlong Zhou, Zhang Zihan, Simon Du",http://arxiv.org/abs/2301.13446,,https://huggingface.co/papers/2301.13446,,,,2301.13446,3,1 A Generalization of ViT/MLP-Mixer to Graphs,"Xiaoxin He, Bryan Hooi, Thomas Laurent, Adam Perold, Yann LeCun, Xavier Bresson",http://arxiv.org/abs/2212.13350,https://github.com/XiaoxinHe/Graph-ViT-MLPMixer,https://huggingface.co/papers/2212.13350,,,,2212.13350,6,1 Dimension-independent Certified Neural Network Watermarks via Mollifier Smoothing,"Jiaxiang Ren, Jiayin Jin, Yang Zhou, Lingjuan Lyu, Da Yan",,,,,,,,, One-Shot Federated Conformal Prediction,"Pierre Humbert, Batiste Le Bars, Aurélien Bellet, Sylvain Arlot",http://arxiv.org/abs/2302.06322,,https://huggingface.co/papers/2302.06322,,,,2302.06322,4,0 A Robust Optimisation Perspective on Counterexample-Guided Repair of Neural Networks,"David Boetius, Stefan Leue, Tobias Sutter",http://arxiv.org/abs/2301.11342,,https://huggingface.co/papers/2301.11342,,,,2301.11342,3,1 EM-Network: Oracle Guided Self-distillation for Sequence Learning,"Ji Won Yoon, Sung Hwan Ahn, Hyeonseung Lee, Minchan Kim, Seok Min Kim, Nam Soo Kim",,,,,,,,, Refined Regret for Adversarial MDPs with Linear Function Approximation,"Yan Dai, Haipeng Luo, Chen-Yu Wei, Julian Zimmert",http://arxiv.org/abs/2301.12942,,https://huggingface.co/papers/2301.12942,,,,2301.12942,4,0 Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations,"Weiwei Lin, Chenhang He, Man-Wai Mak, Youzhi Tu",http://arxiv.org/abs/2305.08099,,https://huggingface.co/papers/2305.08099,,,,2305.08099,4,0 A Likelihood Approach to Nonparametric Estimation of a Singular Distribution Using Deep Generative Models,"Minwoo Chae, Dongha Kim, Yongdai Kim, Lizhen Lin",http://arxiv.org/abs/2105.04046,,https://huggingface.co/papers/2105.04046,,,,2105.04046,4,0 Gradient Descent Monotonically Decreases the Sharpness of Gradient Flow Solutions in Scalar Networks and Beyond,"Itai Kreisler, Mor Shpigel Nacson, Daniel Soudry, Yair Carmon",http://arxiv.org/abs/2305.13064,,https://huggingface.co/papers/2305.13064,,,,2305.13064,4,2 Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning,"Mingqi Yuan, Bo Li, Xin Jin, Wenjun Zeng",http://arxiv.org/abs/2301.10886,,https://huggingface.co/papers/2301.10886,,,,2301.10886,4,1 Finding the Missing-half: Graph Complementary Learning for Homophily-prone and Heterophily-prone Graphs,"YIZHEN ZHENG, He Zhang, Vincent Lee, Yu Zheng, Xiao Wang, Shirui Pan",,,,,,,,, Topological Singularity Detection at Multiple Scales,"Julius Von Rohrscheidt, Bastian Rieck",http://arxiv.org/abs/2210.00069,,https://huggingface.co/papers/2210.00069,,,,2210.00069,2,0 Total Variation Graph Neural Networks,"Jonas B. Hansen, Filippo Maria Bianchi",http://arxiv.org/abs/2211.06218,,https://huggingface.co/papers/2211.06218,,,,2211.06218,2,1 DevFormer: A Symmetric Transformer for Context-Aware Device Placement,"Haeyeon Kim, Minsu Kim, Federico Berto, Joungho Kim, Jinkyoo Park",http://arxiv.org/abs/2205.13225,,https://huggingface.co/papers/2205.13225,,,,2205.13225,5,1 Multi-Task Structural Learning using Local Task Similarity induced Neuron Creation and Removal,"Naresh Kumar Gurulingan, Bahram Zonooz, Elahe Arani",http://arxiv.org/abs/2305.00441,,https://huggingface.co/papers/2305.00441,,,,2305.00441,3,1 Multi-Modal Classifiers for Open-Vocabulary Object Detection,"Prannay Kaul, Weidi Xie, Andrew Zisserman",http://arxiv.org/abs/2306.05493,,https://huggingface.co/papers/2306.05493,,,,2306.05493,3,2 One-vs-the-Rest Loss to Focus on Important Samples in Adversarial Training,"Sekitoshi Kanai, Shin'ya Yamaguchi, Masanori Yamada, Hiroshi Takahashi, Kentaro Ono, Yasutoshi Ida",http://arxiv.org/abs/2207.10283,,https://huggingface.co/papers/2207.10283,,,,2207.10283,6,0 Generalizing Neural Wave Functions,"Nicholas Gao, Stephan Günnemann",http://arxiv.org/abs/2302.04168,,https://huggingface.co/papers/2302.04168,,,,2302.04168,2,0 An Adaptive Entropy-Regularization Framework for Multi-Agent Reinforcement Learning,"WOOJUN KIM, Youngchul Sung",,,,,,,,, DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule,"Maor Ivgi, Oliver Hinder, Yair Carmon",http://arxiv.org/abs/2302.12022,https://github.com/formll/dog,https://huggingface.co/papers/2302.12022,,,,2302.12022,3,1 Detecting Out-of-distribution Data through In-distribution Class Prior,"Xue JIANG, Feng Liu, zhen fang, Hong Chen, Tongliang Liu, Feng Zheng, Bo Han",,,,,,,,, Regression with Sensor Data Containing Incomplete Observations,"Takayuki Katsuki, Takayuki Osogami",http://arxiv.org/abs/2304.13415,,https://huggingface.co/papers/2304.13415,,,,2304.13415,2,0 Finding Generalization Measures by Contrasting Signal and Noise,"Jiaye Teng, Bohang Zhang, Ruichen Li, Haowei He, Yequan Wang, Yan Tian, Yang Yuan",,,,,,,,, Learning Hidden Markov Models When the Locations of Missing Observations are Unknown,"BINYAMIN PERETS, Mark Kozdoba, Shie Mannor",,,,,,,,, "SGD with AdaGrad Stepsizes: Full Adaptivity with High Probability to Unknown Parameters, Unbounded Gradients and Affine Variance","Amit Attia, Tomer Koren",http://arxiv.org/abs/2302.08783,,https://huggingface.co/papers/2302.08783,,,,2302.08783,2,0 A Distributional Optimization-based Framework for Confidence Bounds of Risk Measures,"Hao Liang, Zhi-Quan Luo",,,,,,,,, Neural Inverse Operators for Solving PDE Inverse Problems,"Roberto Molinaro, Yunan Yang, Björn Engquist, Siddhartha Mishra",http://arxiv.org/abs/2301.11167,,https://huggingface.co/papers/2301.11167,,,,2301.11167,4,0 GRAFENNE: Learning on Graphs with Heterogeneous and Dynamic Feature Sets,"Shubham Gupta, Sahil Manchanda, Sayan Ranu, Srikanta Bedathur",http://arxiv.org/abs/2306.03447,,https://huggingface.co/papers/2306.03447,,,,2306.03447,4,1 Bidirectional Looking with A Novel Double Exponential Moving Average to Adaptive and Non-adaptive Momentum Optimizers,"Yineng Chen, Zuchao Li, Lefei Zhang, Bo Du, hai zhao",,,,,,,,, Speed-Oblivious Online Scheduling: Knowing (Precise) Speeds is not Necessary,"Alexander Lindermayr, Nicole Megow, Martin Rapp",http://arxiv.org/abs/2302.00985,,https://huggingface.co/papers/2302.00985,,,,2302.00985,3,0 Aligning Language Models with Preferences through $f$-divergence Minimization,"Dongyoung Go, Tomasz Korbak, Germán Kruszewski, Jos Rozen, Nahyeon Ryu, Marc Dymetman",,,,,,,,, Fascinating Supervisory Signals and Where to Find Them: Deep Anomaly Detection with Scale Learning,"Hongzuo Xu, Yijie Wang, JuHui Wei, Songlei Jian, Yizhou Li, Ning Liu",http://arxiv.org/abs/2305.16114,,https://huggingface.co/papers/2305.16114,,,,2305.16114,6,0 Margin-based Neural Network Watermarking,"Byungjoo Kim, Suyoung Lee, Seanie Lee, Son, Sung Ju Hwang",,,,,,,,, Graph Neural Tangent Kernel: Convergence on Large Graphs,"Sanjukta Krishnagopal, Luana Ruiz",http://arxiv.org/abs/2301.10808,,https://huggingface.co/papers/2301.10808,,,,2301.10808,2,0 TabDDPM: Modelling Tabular Data with Diffusion Models,"Akim Kotelnikov, Dmitry Baranchuk, Ivan Rubachev, Artem Babenko",http://arxiv.org/abs/2209.15421,,https://huggingface.co/papers/2209.15421,,,,2209.15421,4,3 Trustworthy Policy Learning under the Counterfactual No-Harm Criterion,"Haoxuan Li, Chunyuan Zheng, Yixiao Cao, zhi geng, Yue Liu, Peng Wu",,,,,,,,, Projected Tensor Power Method for Hypergraph Community Recovery,"Jinxin Wang, Yuen-Man Pun, Xiaolu Wang, Peng Wang, Anthony Man-Cho So",,,,,,,,, Towards Understanding the Generalization of Graph Neural Networks,"Huayi Tang, Yong Liu",http://arxiv.org/abs/2305.08048,,https://huggingface.co/papers/2305.08048,,,,2305.08048,2,0 Unlocking Slot Attention by Changing Optimal Transport Costs,"Yan Zhang, David Zhang, Simon Lacoste-Julien, Gertjan Burghouts, Cees Snoek",http://arxiv.org/abs/2301.13197,,https://huggingface.co/papers/2301.13197,,,,2301.13197,5,3 Cell-Free Latent Go-Explore,"Quentin Gallouédec, Emmanuel Dellandrea",http://arxiv.org/abs/2208.14928,https://github.com/qgallouedec/lge,https://huggingface.co/papers/2208.14928,,,,2208.14928,2,1 Minimalistic Predictions to Schedule Jobs with Online Precedence Constraints,"Alexandra Lassota, Alexander Lindermayr, Nicole Megow, Jens Schlöter",http://arxiv.org/abs/2301.12863,,https://huggingface.co/papers/2301.12863,,,,2301.12863,4,0 Temporal Label Smoothing for Early Event Prediction,"Hugo Yèche, Alizée Pace, Gunnar Ratsch, Rita Kuznetsova",http://arxiv.org/abs/2208.13764,,https://huggingface.co/papers/2208.13764,,,,2208.13764,4,1 Bandit Multi-linear DR-Submodular Maximization and Its Applications on Adversarial Submodular Bandits,"Zongqi Wan, Jialin Zhang, Wei Chen, Xiaoming Sun, Zhijie Zhang",http://arxiv.org/abs/2305.12402,,https://huggingface.co/papers/2305.12402,,,,2305.12402,5,0 VectorMapNet: End-to-end Vectorized HD Map Learning,"yicheng Liu, Tianyuan Yuan, Yue Wang, Yilun Wang, Hang Zhao",http://arxiv.org/abs/2206.08920,,https://huggingface.co/papers/2206.08920,,,,2206.08920,5,0 Relevant Walk Search for Explaining Graph Neural Networks,"Ping Xiong, Thomas Schnake, Michael Gastegger, Grégoire Montavon, Klaus-robert Mueller, Shinichi Nakajima",,,,,,,,, Shapley Based Residual Decomposition for Instance Analysis,"Tommy Liu, Amanda Barnard",http://arxiv.org/abs/2305.18818,,https://huggingface.co/papers/2305.18818,,,,2305.18818,2,0 Alternating Local Enumeration (TnALE): Solving Tensor Network Structure Search with Fewer Evaluations,"Chao Li, Junhua Zeng, Chunmei Li, Cesar F Caiafa, Qibin Zhao",http://arxiv.org/abs/2304.12875,,https://huggingface.co/papers/2304.12875,,,,2304.12875,5,0 Understanding and Defending Patched-based Adversarial Attacks for Vision Transformer,"Liang Liu, Yanan Guo, Youtao Zhang, Jun Yang",,,,,,,,, On the Generalization of Multi-modal Contrastive Learning,"Qi Zhang, Yifei Wang, Yisen Wang",http://arxiv.org/abs/2306.04272,https://github.com/PKU-ML/CLIP-Help-SimCLR,https://huggingface.co/papers/2306.04272,,,,2306.04272,3,0 DIFF2: Differential Private Optimization via Gradient Differences for Nonconvex Distributed Learning,"Tomoya Murata, Taiji Suzuki",http://arxiv.org/abs/2302.03884,,https://huggingface.co/papers/2302.03884,,,,2302.03884,2,0 On the Robustness of Text Vectorizers,"Rémi Catellier, Samuel Vaiter, Damien Garreau",http://arxiv.org/abs/2303.07203,,https://huggingface.co/papers/2303.07203,,,,2303.07203,3,1 XTab: Cross-table Pretraining for Tabular Transformers,"Bingzhao Zhu, Xingjian Shi, Nick Erickson, Mu Li, George Karypis, Mahsa Shoaran",http://arxiv.org/abs/2305.06090,,https://huggingface.co/papers/2305.06090,,,,2305.06090,6,1 FREDIS: A Fusion Framework of Refinement and Disambiguation for Unreliable Partial Label Learning,"Congyu Qiao, Ning Xu, JIAQI LYU, yi ren, Xin Geng",,,,,,,,, Explainable Data-Driven Optimization: From Context to Decision and Back Again,"Alexandre Forel, Axel Parmentier, Thibaut Vidal",http://arxiv.org/abs/2301.10074,,https://huggingface.co/papers/2301.10074,,,,2301.10074,3,0 Robust Weak Supervision with Variational Auto-Encoders,"Francesco Tonolini, Nikolaos Aletras, Yunlong Jiao, Gabriella Kazai",,,,,,,,, Weakly Supervised Disentangled Generative Causal Representation Learning,"Xinwei Shen, Furui Liu, Hanze Dong, Qing Lian, Zhitang Chen, Tong Zhang",http://arxiv.org/abs/2010.02637,,https://huggingface.co/papers/2010.02637,,,,2010.02637,6,1 Are Diffusion Models Vulnerable to Membership Inference Attacks?,"Jinhao Duan, Fei Kong, Shiqi Wang, Xiaoshuang Shi, Kaidi Xu",http://arxiv.org/abs/2302.01316,https://github.com/jinhaoduan/SecMI,https://huggingface.co/papers/2302.01316,,,,2302.01316,5,0 Parameter-Level Soft-Masking for Continual Learning,"Tatsuya Konishi, Tatsuya Konishi, Mori Kurokawa, Chihiro Ono, Zixuan Ke, Gyuhak Kim, Bing Liu",,,,,,,,, ILLUME: Rationalizing Vision-Language Models through Human Interactions,"Manuel Brack, Patrick Schramowski, Björn Deiseroth, Kristian Kersting",http://arxiv.org/abs/2208.08241,,https://huggingface.co/papers/2208.08241,,,,2208.08241,4,2 MultiRobustBench: Benchmarking Robustness Against Multiple Attacks,"Sihui Dai, Saeed Mahloujifar, Chong Xiang, Vikash Sehwag, Pin-Yu Chen, Prateek Mittal",http://arxiv.org/abs/2302.10980,,https://huggingface.co/papers/2302.10980,,,,2302.10980,6,3 Sequential Kernelized Independence Testing,"Aleksandr Podkopaev, Patrick Bloebaum, Shiva Kasiviswanathan, Aaditya Ramdas",http://arxiv.org/abs/2212.07383,,https://huggingface.co/papers/2212.07383,,,,2212.07383,4,0 Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using Online Function Approximation,"Orin Levy, Alon Cohen, Asaf Cassel, Yishay Mansour",http://arxiv.org/abs/2303.01464,,https://huggingface.co/papers/2303.01464,,,,2303.01464,4,0 Spurious Valleys and Clustering Behavior of Neural Networks,Samuele Pollaci,,,,,,,,, "Reasons for the Superiority of Stochastic Estimators over Deterministic Ones: Robustness, Consistency and Perceptual Quality","Guy Ohayon, Theo Adrai, Michael Elad, Tomer Michaeli",http://arxiv.org/abs/2211.08944,,https://huggingface.co/papers/2211.08944,,,,2211.08944,4,0 Implicit Graph Neural Networks: A Monotone Operator Viewpoint,"Justin Baker, Qingsong Wang, Cory Hauck, Bao Wang",,,,,,,,, $\pi$-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation,"CHENGYUE WU, Teng Wang, Yixiao Ge, Zeyu Lu, Ruisong Zhou, Ying Shan, Ping Luo",,,,,,,,, Revisiting Bellman Errors for Offline Model Selection,"Joshua Zitovsky, Daniel de Marchi, Rishabh Agarwal, Michael Kosorok",http://arxiv.org/abs/2302.00141,,https://huggingface.co/papers/2302.00141,,,,2302.00141,4,2 The Numerical Stability of Hyperbolic Representation Learning,"Gal Mishne, Zhengchao Wan, Yusu Wang, Sheng Yang",http://arxiv.org/abs/2211.00181,,https://huggingface.co/papers/2211.00181,,,,2211.00181,4,0 Fully-Adaptive Composition in Differential Privacy,"Justin Whitehouse, Aaditya Ramdas, Ryan Rogers, Steven Wu",,,,,,,,, Revisiting Pseudo-Label for Single-Positive Multi-Label Learning,"biao liu, Ning Xu, JIAQI LYU, Xin Geng",,,,,,,,, "Generating Novel, Designable, and Diverse Protein Structures by Equivariantly Diffusing Oriented Residue Clouds","Yeqing Lin, Mohammed AlQuraishi",http://arxiv.org/abs/2301.12485,https://github.com/aqlaboratory/genie,https://huggingface.co/papers/2301.12485,,,,2301.12485,2,1 Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills,"Seongun Kim, Kyowoon Lee, Jaesik Choi",,,,,,,,, SinFusion: Training Diffusion Models on a Single Image or Video,"Yaniv Nikankin, Niv Haim, Michal Irani",http://arxiv.org/abs/2211.11743,,https://huggingface.co/papers/2211.11743,,,,2211.11743,3,1 Sequential Predictive Conformal Inference for Time Series,"Chen Xu, Yao Xie",http://arxiv.org/abs/2212.03463,,https://huggingface.co/papers/2212.03463,,,,2212.03463,2,0 Is Consensus Acceleration Possible in Decentralized Optimization over Slowly Time-Varying Networks?,"Dmitry Metelev, Alexander Rogozin, Dmitry Kovalev, Alexander Gasnikov",http://arxiv.org/abs/2301.11817,,https://huggingface.co/papers/2301.11817,,,,2301.11817,4,0 Understanding the Complexity Gains of Single-Task RL with a Curriculum,"Qiyang Li, Yuexiang Zhai, Yi Ma, Sergey Levine",http://arxiv.org/abs/2212.12809,,https://huggingface.co/papers/2212.12809,,,,2212.12809,4,0 Fast Online Value-Maximizing Prediction Sets with Conformal Cost Control,"Zhen Lin, Shubhendu Trivedi, Cao Xiao, Jimeng Sun",http://arxiv.org/abs/2302.00839,https://github.com/zlin7/FavMac,https://huggingface.co/papers/2302.00839,,,,2302.00839,4,1 Discrete Key-Value Bottleneck,"Frederik Träuble, Anirudh Goyal, Nasim Rahaman, Michael Mozer, Kenji Kawaguchi, Yoshua Bengio, Bernhard Schölkopf",http://arxiv.org/abs/2207.11240,,https://huggingface.co/papers/2207.11240,,,,2207.11240,7,0 Concurrent Shuffle Differential Privacy Under Continual Observation,"Jay Tenenbaum, Haim Kaplan, Yishay Mansour, Uri Stemmer",http://arxiv.org/abs/2301.12535,,https://huggingface.co/papers/2301.12535,,,,2301.12535,4,0 Statistical Learning under Heterogenous Distribution Shift,"Max Simchowitz, Anurag Ajay, Pulkit Agrawal, Akshay Krishnamurthy",http://arxiv.org/abs/2302.13934,,https://huggingface.co/papers/2302.13934,,,,2302.13934,4,2 The Power of Learned Locally Linear Models for Nonlinear Policy Optimization,"Daniel Pfrommer, Max Simchowitz, Tyler Westenbroek, Nikolai Matni, Stephen Tu",http://arxiv.org/abs/2305.09619,,https://huggingface.co/papers/2305.09619,,,,2305.09619,5,0 Special Properties of Gradient Descent with Large Learning Rates,"Amirkeivan Mohtashami, Martin Jaggi, Sebastian Stich",http://arxiv.org/abs/2205.15142,,https://huggingface.co/papers/2205.15142,,,,2205.15142,3,0 PAC-Bayesian Offline Contextual Bandits With Guarantees,"Otmane Sakhi, Pierre Alquier, Nicolas Chopin",http://arxiv.org/abs/2210.13132,,https://huggingface.co/papers/2210.13132,,,,2210.13132,3,1 Scalable Adaptive Computation for Iterative Generation,"Allan Jabri, David Fleet, Ting Chen",http://arxiv.org/abs/2212.11972,,https://huggingface.co/papers/2212.11972,,,,2212.11972,3,0 Fairness in Streaming Submodular Maximization over a Matroid Constraint,"Marwa El Halabi, Federico Fusco, Ashkan Norouzi-Fard, Jakab Tardos, Jakub Tarnawski",http://arxiv.org/abs/2305.15118,,https://huggingface.co/papers/2305.15118,,,,2305.15118,5,0 Cluster Explanation via Polyhedral Descriptions,"Connor Lawless, Oktay Gunluk",http://arxiv.org/abs/2210.08798,,https://huggingface.co/papers/2210.08798,,,,2210.08798,2,0 PAC-Bayesian Generalization Bounds for Adversarial Generative Models,"Sokhna Diarra Mbacke, Florence Clerc, Pascal Germain",http://arxiv.org/abs/2302.08942,,https://huggingface.co/papers/2302.08942,,,,2302.08942,3,1 Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation,"Yifei Min, Jiafan He, Jiafan He, Tianhao Wang, Quanquan Gu",http://arxiv.org/abs/2305.06446,,https://huggingface.co/papers/2305.06446,,,,2305.06446,4,1 Consistency Models,"Yang Song, Prafulla Dhariwal, Mark Chen, Ilya Sutskever",http://arxiv.org/abs/2303.01469,,https://huggingface.co/papers/2303.01469,,,,2303.01469,4,0 A Two-Stage Active Learning Algorithm for k-Nearest Neighbors,"Nicholas Rittler, Kamalika Chaudhuri",,,,,,,,, PINA: Leveraging Side Information in eXtreme Multi-label Classification via Predicted Instance Neighborhood Aggregation,"Eli Chien, Jiong Zhang, Cho-Jui Hsieh, Jyun-Yu Jiang, Wei-Cheng Chang, Olgica Milenkovic, Hsiang-Fu Yu",http://arxiv.org/abs/2305.12349,,https://huggingface.co/papers/2305.12349,,,,2305.12349,7,0 Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models,"Wenhao Ding, Tong Che, Ding Zhao, Marco Pavone",http://arxiv.org/abs/2305.11340,,https://huggingface.co/papers/2305.11340,,,,2305.11340,4,0 CLUTR: Curriculum Learning via Unsupervised Task Representation Learning,"Abdus Salam Azad, Izzeddin Gur, Jasper Emhoff, Nathaniel Alexis, Aleksandra Faust, Pieter Abbeel, Ion Stoica",http://arxiv.org/abs/2210.10243,,https://huggingface.co/papers/2210.10243,,,,2210.10243,7,2 Robust Collaborative Learning with Linear Gradient Overhead,"Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Lê-Nguyên Hoang, Rafael Pinot, John Stephan",http://arxiv.org/abs/2209.10931,https://github.com/LPD-EPFL/robust-collaborative-learning,https://huggingface.co/papers/2209.10931,,,,2209.10931,6,1 RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents,"Rafael A Rodriguez-Sanchez, Benjamin Spiegel, Jennifer Wang, Roma Patel, Stefanie Tellex, George Konidaris",http://arxiv.org/abs/2208.06448,,https://huggingface.co/papers/2208.06448,,,,2208.06448,6,1 Adversarial Cheap Talk,"Christopher Lu, Timon Willi, Alistair Letcher, Jakob Foerster",http://arxiv.org/abs/2211.11030,,https://huggingface.co/papers/2211.11030,,,,2211.11030,4,1 Algorithmic Collective Action in Machine Learning,"Moritz Hardt, Eric Mazumdar, Celestine Mendler-Dünner, Tijana Zrnic",http://arxiv.org/abs/2302.04262,,https://huggingface.co/papers/2302.04262,,,,2302.04262,4,0 Improving Graph Generation by Restricting Graph Bandwidth,"Nathaniel Diamant, Alex Tseng, Kangway Chuang, Tommaso Biancalani, Gabriele Scalia",http://arxiv.org/abs/2301.10857,,https://huggingface.co/papers/2301.10857,,,,2301.10857,5,0 UMD: Unsupervised Model Detection for X2X Backdoor Attacks,"Zhen Xiang, Zidi Xiong, Bo Li",http://arxiv.org/abs/2305.18651,,https://huggingface.co/papers/2305.18651,,,,2305.18651,3,1 Constraint Reasoning Embedded Structured Prediction,"Nan Jiang, Maosen Zhang, Willem-Jan van Hoeve, Yexiang Xue",,,,,,,,, Loss-Guided Diffusion Models for Plug-and-Play Controllable Generation,"Jiaming Song, Qinsheng Zhang, Hongxu Yin, Morteza Mardani, Ming-Yu Liu, Jan Kautz, Yongxin Chen, Arash Vahdat",,,,,,,,, Entropy-driven Unsupervised Keypoint Representation Learning in Videos,"Ali Younes, Simone Schaub-Meyer, Georgia Chalvatzaki",http://arxiv.org/abs/2209.15404,,https://huggingface.co/papers/2209.15404,,,,2209.15404,3,1 Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?,"Boris Knyazev, DOHA HWANG, Simon Lacoste-Julien",http://arxiv.org/abs/2303.04143,,https://huggingface.co/papers/2303.04143,,,,2303.04143,3,1 Private Statistical Estimation of Many Quantiles,"Clément Lalanne, Aurélien Garivier, Rémi Gribonval",http://arxiv.org/abs/2302.06943,,https://huggingface.co/papers/2302.06943,,,,2302.06943,3,0 Learning Neural PDE Solvers with Parameter-Guided Channel Attention,"Makoto Takamoto, Francesco Alesiani, Mathias Niepert",http://arxiv.org/abs/2304.14118,,https://huggingface.co/papers/2304.14118,,,,2304.14118,3,0 Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability,"Thomy Phan, Fabian Ritz, Philipp Altmann, Maximilian Zorn, Jonas Nüßlein, Michael Kölle, Thomas Gabor, Claudia Linnhoff-Popien",http://arxiv.org/abs/2301.01649,,https://huggingface.co/papers/2301.01649,,,,2301.01649,8,1 Geometric Latent Diffusion Models for 3D Molecule Generation,"Minkai Xu, Alexander Powers, Ron Dror, Stefano Ermon, Jure Leskovec",http://arxiv.org/abs/2305.01140,https://github.com/MinkaiXu/GeoLDM,https://huggingface.co/papers/2305.01140,,,,2305.01140,5,1 Efficient preconditioned stochastic gradient descent for estimation in latent variable models,"Charlotte Baey, Maud DELATTRE, Estelle Kuhn, Jean-Benoist Leger, Sarah Lemler",,,,,,,,, Privacy-Aware Compression for Federated Learning Through Numerical Mechanism Design,"Chuan Guo, Kamalika Chaudhuri, Pierre Stock, Michael Rabbat",http://arxiv.org/abs/2211.03942,,https://huggingface.co/papers/2211.03942,,,,2211.03942,4,1 Auxiliary Modality Learning with Generalized Curriculum Distillation,"Yu Shen, Xijun Wang, Peng Gao, Ming Lin",,,,,,,,, Training Deep Surrogate Models with Large Scale Online Learning,"Lucas Meyer, Marc Schouler, Robert Caulk, Alejandro Ribes, Bruno Raffin",,,,,,,,, The Benefits of Model-Based Generalization in Reinforcement Learning,"Kenny Young, Aditya Ramesh, Louis Kirsch, Jürgen Schmidhuber",http://arxiv.org/abs/2211.02222,,https://huggingface.co/papers/2211.02222,,,,2211.02222,4,1 A new near-linear time algorithm for k-nearest neighbor search using a compressed cover tree,"Yury Elkin, Vitaliy Kurlin",,,,,,,,, PAC Generalization via Invariant Representations,"Advait Parulekar, Karthikeyan Shanmugam, Sanjay Shakkottai",http://arxiv.org/abs/2205.15196,,https://huggingface.co/papers/2205.15196,,,,2205.15196,3,1 Counterfactual Analysis in Dynamic Latent State Models,"Martin Haugh, Raghav Singal",http://arxiv.org/abs/2205.13832,,https://huggingface.co/papers/2205.13832,,,,2205.13832,2,0 Fully Dynamic Submodular Maximization over Matroids,"PAUL DUETTING, Federico Fusco, Silvio Lattanzi, Ashkan Norouzi-Fard, Morteza Zadimoghaddam",http://arxiv.org/abs/2305.19918,,https://huggingface.co/papers/2305.19918,,,,2305.19918,5,0 Can Forward Gradient Match Backpropagation?,"Louis Fournier, Stéphane Rivaud SORBONNE UNIVERSITE ISIR, Eugene Belilovsky, Michael Eickenberg, Edouard Oyallon",http://arxiv.org/abs/2306.06968,,https://huggingface.co/papers/2306.06968,,,,2306.06968,5,2 Decoding Layer Saliency in Transformers,"Elizabeth Hou, Gregory Castanon",,,,,,,,, On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization,"Mudit Gaur, Vaneet Aggarwal, Mridul Agarwal",,,,,,,,, Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits,"Ronshee Chawla, Daniel Vial, Sanjay Shakkottai, R Srikant",http://arxiv.org/abs/2305.18784,,https://huggingface.co/papers/2305.18784,,,,2305.18784,4,0 Divide and Conquer Dynamic Programming: An Almost Linear Time Change Point Detection Methodology in High Dimensions,"Wanshan Li, Daren Wang, Alessandro Rinaldo",http://arxiv.org/abs/2301.10942,,https://huggingface.co/papers/2301.10942,,,,2301.10942,3,1 Automatic Data Augmentation via Invariance-Constrained Learning,"Ignacio Hounie, Luiz Chamon, Alejandro Ribeiro",http://arxiv.org/abs/2209.15031,,https://huggingface.co/papers/2209.15031,,,,2209.15031,3,0 Sequential Counterfactual Risk Minimization,"Houssam Zenati, Eustache Diemert, Matthieu Martin, Julien Mairal, Pierre Gaillard",http://arxiv.org/abs/2302.12120,,https://huggingface.co/papers/2302.12120,,,,2302.12120,5,0 Sequential Monte Carlo Learning for Time Series Structure Discovery,"Feras Saad, Brian Patton, Matthew Hoffman, Rif Saurous, Vikash Mansinghka",,,,,,,,, Robust Speech Recognition via Large-Scale Weak Supervision,"Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey, Ilya Sutskever",http://arxiv.org/abs/2212.04356,,https://huggingface.co/papers/2212.04356,,,,2212.04356,6,0 Neural Algorithmic Reasoning with Causal Regularisation,"Beatrice Bevilacqua, Kyriacos Nikiforou, Borja Ibarz, Ioana Bica, Michela Paganini, Charles Blundell, Jovana Mitrovic, Petar Veličković",http://arxiv.org/abs/2302.10258,,https://huggingface.co/papers/2302.10258,,,,2302.10258,8,0 Reducing SO(3) Convolutions to SO(2) for Efficient Equivariant GNNs,"Saro Passaro, Larry Zitnick",http://arxiv.org/abs/2302.03655,,https://huggingface.co/papers/2302.03655,,,,2302.03655,2,0 Probabilistic Categorical Adversarial Attack and Adversarial Training,"Han Xu, Pengfei He, Jie Ren, Yuxuan Wan, Zitao Liu, Hui Liu, Jiliang Tang",,,,,,,,, Linear optimal partial transport embedding,"Yikun Bai, Ivan Medri, Rocio Diaz Martin, Rana Muhammad Shahroz Khan, Soheil Kolouri",http://arxiv.org/abs/2302.03232,,https://huggingface.co/papers/2302.03232,,,,2302.03232,5,0 Graph Generative Model for Benchmarking Graph Neural Networks,"Minji Yoon, Yue Wu, John Palowitch, Bryan Perozzi, Ruslan Salakhutdinov",http://arxiv.org/abs/2207.04396,,https://huggingface.co/papers/2207.04396,,,,2207.04396,5,0 Differentiable and Transportable Structure Learning,"Jeroen Berrevoets, Nabeel Seedat, Fergus Imrie, Mihaela van der Schaar",http://arxiv.org/abs/2206.06354,,https://huggingface.co/papers/2206.06354,,,,2206.06354,4,0 "Pricing Experimental Design: Causal Effect, Expected Revenue and Tail Risk","David Simchi-Levi, Chonghuan Wang",,,,,,,,, Deep Latent State Space Models for Time-Series Generation,"Linqi Zhou, Michael Poli, Winnie Xu, Stefano Massaroli, Stefano Ermon",http://arxiv.org/abs/2212.12749,,https://huggingface.co/papers/2212.12749,,,,2212.12749,5,2 Curious Replay for Model-based Adaptation,"Isaac Kauvar, Chris Doyle, Linqi Zhou, Nick Haber",,,,,,,,, HOPE: High-order Graph ODE For Modeling Interacting Dynamics,"Xiao Luo, Jingyang Yuan, Zijie Huang, Huiyu Jiang, Yifang Qin, Wei Ju, Ming Zhang, Yizhou Sun",,,,,,,,, Adversarial Classification: Necessary Conditions and Geometric Flows,"Nicolas Garcia Trillos, Ryan Murray",http://arxiv.org/abs/2011.10797,,https://huggingface.co/papers/2011.10797,,,,2011.10797,2,0 On User-Level Private Convex Optimization,"Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Raghu Meka, Chiyuan Zhang",http://arxiv.org/abs/2305.04912,,https://huggingface.co/papers/2305.04912,,,,2305.04912,6,0 Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes,"Jiafan He, Jiafan He, Heyang Zhao, Dongruo Zhou, Quanquan Gu",,,,,,,,, Abstracting Imperfect Information Away from Two-Player Zero-Sum Games,"Samuel Sokota, Ryan D'Orazio, Chun Kai Ling, David Wu, Zico Kolter, Noam Brown",http://arxiv.org/abs/2301.09159,,https://huggingface.co/papers/2301.09159,,,,2301.09159,6,0 LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning,"Timothy Castiglia, Yi Zhou, Shiqiang Wang, Swanand Kadhe, Nathalie Baracaldo, Stacy Patterson",,,,,,,,, Global Selection of Contrastive Batches via Optimization on Sample Permutations,"Vin Sachidananda, Ziyi Yang, Chenguang Zhu",,,,,,,,, Nonparametric Density Estimation under Distribution Drift,"Alessio Mazzetto, Eli Upfal",http://arxiv.org/abs/2302.02460,,https://huggingface.co/papers/2302.02460,,,,2302.02460,2,0 Weighted Tallying Bandits: Overcoming Intractability via Repeated Exposure Optimality,"Dhruv Malik, Conor Igoe, Yuanzhi Li, Aarti Singh",http://arxiv.org/abs/2305.02955,,https://huggingface.co/papers/2305.02955,,,,2305.02955,4,0 Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions,"Boxiang Lyu, Zhe Feng, Zach Robertson, Sanmi Koyejo",http://arxiv.org/abs/2306.01799,,https://huggingface.co/papers/2306.01799,,,,2306.01799,4,1 Generative Graph Dictionary Learning,"Zhichen Zeng, Ruike Zhu, Yinglong Xia, Hanqing Zeng, Hanghang Tong",,,,,,,,, The multimarginal optimal transport formulation of adversarial multiclass classification,"Nicolas Garcia Trillos, Matt Jacobs, Jakwang Kim",http://arxiv.org/abs/2204.12676,,https://huggingface.co/papers/2204.12676,,,,2204.12676,3,0 Recasting Self-Attention with Holographic Reduced Representations,"Mohammad Mahmudul Alam, Edward Raff, Stella Biderman, Tim Oates, James Holt",http://arxiv.org/abs/2305.19534,https://github.com/NeuromorphicComputationResearchProgram/Hrrformer,https://huggingface.co/papers/2305.19534,,,,2305.19534,5,1 ACAT: Adversarial Counterfactual Attention for Classification and Detection in Medical Imaging,"Alessandro Fontanella, Antreas Antoniou, Wenwen Li, Joanna Wardlaw, Grant Mair, Emanuele Trucco, Amos Storkey",http://arxiv.org/abs/2303.15421,,https://huggingface.co/papers/2303.15421,,,,2303.15421,7,0 Benign Overfitting in Deep Neural Networks under Lazy Training,"Zhenyu Zhu, Fanghui Liu, Grigorios Chrysos, Francesco Locatello, Volkan Cevher",http://arxiv.org/abs/2305.19377,,https://huggingface.co/papers/2305.19377,,,,2305.19377,5,0 A Hybrid Quantum-Classical Approach based on the Hadamard Transform for the Convolutional Layer,"Hongyi Pan, Xin Zhu, Salih Furkan Atici, Ahmet Cetin",http://arxiv.org/abs/2305.17510,,https://huggingface.co/papers/2305.17510,,,,2305.17510,4,0 Coder Reviewer Reranking for Code Generation,"Tianyi Zhang, Tao Yu, Tatsunori Hashimoto, Mike Lewis, Scott Yih, Daniel Fried, Sida Wang",http://arxiv.org/abs/2211.16490,,https://huggingface.co/papers/2211.16490,,,,2211.16490,7,0 Causal Isotonic Calibration for Heterogeneous Treatment Effects,"Lars van der Laan, Ernesto Ulloa-Perez, Marco Carone, Alex Luedtke",http://arxiv.org/abs/2302.14011,,https://huggingface.co/papers/2302.14011,,,,2302.14011,4,0 Approximation Algorithms for Fair Range Clustering,"Sedjro Salomon Hotegni, Sepideh Mahabadi, Ali Vakilian",http://arxiv.org/abs/2306.06778,,https://huggingface.co/papers/2306.06778,,,,2306.06778,3,1 Efficient Bound of Lipschitz Constant for Convolutional Layers by Gram Iteration,"Blaise Delattre, Quentin Barthélemy, Alexandre Araujo, Alexandre Allauzen",http://arxiv.org/abs/2305.16173,https://github.com/blaisedelattre/lip4conv,https://huggingface.co/papers/2305.16173,,,,2305.16173,4,0 Markovian Gaussian Process Variational Autoencoders,"Harrison Zhu, Carles Balsells Rodas, Yingzhen Li",http://arxiv.org/abs/2207.05543,,https://huggingface.co/papers/2207.05543,,,,2207.05543,3,0 Likelihood Adjusted Semidefinite Programs for Clustering Heterogeneous Data,"Yubo Zhuang, Xiaohui Chen, Yun Yang",http://arxiv.org/abs/2209.15097,,https://huggingface.co/papers/2209.15097,,,,2209.15097,3,0 Dimensionality Reduction for General KDE Mode Finding,"Xinyu Luo, Christopher Musco, Cas Widdershoven",http://arxiv.org/abs/2305.18755,,https://huggingface.co/papers/2305.18755,,,,2305.18755,3,0 SAM operates far from home: eigenvalue regularization as a dynamical phenomenon,"Atish Agarwala, Yann Nicolas Dauphin",http://arxiv.org/abs/2302.08692,,https://huggingface.co/papers/2302.08692,,,,2302.08692,2,0 Proximal Causal Learning of Conditional Average Treatment Effects,"Erik Sverdrup, Yifan Cui",http://arxiv.org/abs/2301.10913,,https://huggingface.co/papers/2301.10913,,,,2301.10913,2,0 Towards Robust Graph Incremental Learning on Evolving Graphs,"Junwei Su, Difan Zou, Zijun Zhang, Chuan Wu",,,,,,,,, Incentivizing Exploration with Linear Contexts and Combinatorial Actions,Mark Sellke,http://arxiv.org/abs/2306.01990,,https://huggingface.co/papers/2306.01990,,,,2306.01990,1,0 Interpretable Neural-Symbolic Concept Reasoning,"Pietro Barbiero, Gabriele Ciravegna, Francesco Giannini, Mateo Espinosa Zarlenga, Lucie Charlotte Magister, Alberto Tonda, Pietro Lió, Frederic Precioso, Mateja Jamnik, Giuseppe Marra",http://arxiv.org/abs/2304.14068,,https://huggingface.co/papers/2304.14068,,,,2304.14068,10,0 Coordinate Descent Methods for Fractional Minimization,Ganzhao Yuan,http://arxiv.org/abs/2201.12691,,https://huggingface.co/papers/2201.12691,,,,2201.12691,1,0 Controlling Posterior Collapse by an Inverse Lipschitz Constraint on the Decoder Network,"Yuri Kinoshita, Kenta Oono, Kenji Fukumizu, Yuichi Yoshida, Shin-ichi Maeda",http://arxiv.org/abs/2304.12770,,https://huggingface.co/papers/2304.12770,,,,2304.12770,5,0 How to Trust Your Diffusion Model: A Convex Optimization Approach to Conformal Risk Control,"Jacopo Teneggi, Matthew Tivnan, Web Stayman, Jeremias Sulam",http://arxiv.org/abs/2302.03791,,https://huggingface.co/papers/2302.03791,,,,2302.03791,4,1 On the Stepwise Nature of Self-Supervised Learning,"James B Simon, Maksis Knutins, Liu Ziyin, Daniel Geisz, Abraham Fetterman, Joshua Albrecht",http://arxiv.org/abs/2303.15438,,https://huggingface.co/papers/2303.15438,,,,2303.15438,6,1 A Kernel-Based View of Language Model Fine-Tuning,"Sadhika Malladi, Alexander Wettig, Dingli Yu, Danqi Chen, Sanjeev Arora",http://arxiv.org/abs/2210.05643,,https://huggingface.co/papers/2210.05643,,,,2210.05643,5,0 The Impact of Exploration on Convergence and Performance of Multi-Agent Q-Learning Dynamics,"Aamal Hussain, Francesco Belardinelli, Dario Paccagnan",,,,,,,,, Data-Copying in Generative Models: A Formal Framework,"Robi Bhattacharjee, Sanjoy Dasgupta, Kamalika Chaudhuri",http://arxiv.org/abs/2302.13181,,https://huggingface.co/papers/2302.13181,,,,2302.13181,3,0 Deep Regression Unlearning,"Ayush Tarun, Vikram Chundawat, Murari Mandal, Mohan Kankanhalli",http://arxiv.org/abs/2210.08196,https://github.com/ayu987/deep-regression-unlearning,https://huggingface.co/papers/2210.08196,,,,2210.08196,4,1 DeSRA: Detect and Delete the Artifacts of GAN-based Real-World Super-Resolution Models ,"Liangbin Xie, Xintao Wang, Xiangyu Chen, Gen Li, Ying Shan, Jiantao Zhou, Chao Dong",,,,,,,,, B-Learner: Quasi-Oracle Bounds on Heterogeneous Causal Effects Under Hidden Confounding,"Miruna Oprescu, Jacob Dorn, Marah Ghoummaid, Andrew Jesson, Nathan Kallus, Uri Shalit",,,,,,,,, How to address monotonicity for model risk management?,"Dangxing Chen, Weicheng Ye",http://arxiv.org/abs/2305.00799,,https://huggingface.co/papers/2305.00799,,,,2305.00799,2,0 Learning Deep Time-index Models for Time Series Forecasting,"Gerald Woo, Chenghao Liu, Doyen Sahoo, Akshat Kumar, Steven Hoi",http://arxiv.org/abs/2207.06046,https://github.com/salesforce/DeepTime,https://huggingface.co/papers/2207.06046,,,,2207.06046,5,2 Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning,"gang Ding, Wanpeng Zhang, Junpeng Yue, XJ Wang, Tiejun Huang, Zongqing Lu",http://arxiv.org/abs/2210.13942,,https://huggingface.co/papers/2210.13942,,,,2210.13942,6,0 Eventual Discounting Temporal Logic Counterfactual Experience Replay,"Cameron Voloshin, Abhinav Verma, Yisong Yue",http://arxiv.org/abs/2303.02135,,https://huggingface.co/papers/2303.02135,,,,2303.02135,3,0 Achieving Hierarchy-Free Approximation for Bilevel Programs with Equilibrium Constraints,"Jiayang Li, Jing Yu, Boyi Liu, Yu Nie, Zhaoran Wang",http://arxiv.org/abs/2302.09734,,https://huggingface.co/papers/2302.09734,,,,2302.09734,5,0 Wrapped Cauchy Distributed Angular Softmax for Long-Tailed Visual Recognition,Boran Han,http://arxiv.org/abs/2305.18732,,https://huggingface.co/papers/2305.18732,,,,2305.18732,1,1 Policy Gradient in Robust MDPs with Global Convergence Guarantee,"Qiuhao Wang, Chin Pang Ho, Marek Petrik",http://arxiv.org/abs/2212.10439,,https://huggingface.co/papers/2212.10439,,,,2212.10439,3,0 Addressing Budget Allocation and Revenue Allocation in Data Market Environment Using an Adaptive Sampling Algorithm,"Boxin Zhao, Boxiang Lyu, Raul Castro Fernandez, Mladen Kolar",,,,,,,,, Theoretical Bounds on the Network Community Profile from Low-rank Semi-definite Programming,"Yufan Huang, C. Seshadhri, David Gleich",http://arxiv.org/abs/2303.14550,,https://huggingface.co/papers/2303.14550,,,,2303.14550,3,0 Learning Mixtures of Gaussians with Censored Data,"Wai Ming Tai, Bryon Aragam",http://arxiv.org/abs/2305.04127,,https://huggingface.co/papers/2305.04127,,,,2305.04127,2,0 An Information-Theoretic Analysis of Nonstationary Bandit Learning,"Seungki Min, Daniel Russo",http://arxiv.org/abs/2302.04452,,https://huggingface.co/papers/2302.04452,,,,2302.04452,2,0 Inverse Reinforcement Learning without Reinforcement Learning,"Gokul Swamy, David Wu, Sanjiban Choudhury, J. Bagnell, Steven Wu",http://arxiv.org/abs/2303.14623,,https://huggingface.co/papers/2303.14623,,,,2303.14623,4,0 Efficient Online Reinforcement Learning with Offline Data,"Philip Ball, Laura Smith, Ilya Kostrikov, Sergey Levine",http://arxiv.org/abs/2302.02948,https://github.com/ikostrikov/rlpd,https://huggingface.co/papers/2302.02948,,,,2302.02948,4,1 When Sparsity Meets Contrastive Models: Less Graph Data Can Bring Better Class-Balanced Representations,"Chunhui Zhang, Chao Huang, Yijun Tian, Qianlong Wen, Zhongyu Ouyang, Youhuan Li, Yanfang Ye, Chuxu Zhang",,,,,,,,, Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning,"Yulai Zhao, Zhuoran Yang, Zhaoran Wang, Jason Lee",http://arxiv.org/abs/2305.04819,,https://huggingface.co/papers/2305.04819,,,,2305.04819,4,3 Learning Globally Smooth Functions on Manifolds,"Juan Cervino, Luiz Chamon, Benjamin Haeffele, Rene Vidal, Alejandro Ribeiro",http://arxiv.org/abs/2210.00301,,https://huggingface.co/papers/2210.00301,,,,2210.00301,5,0 Poisoning Generative Replay in Continual Learning to Promote Forgetting,"Siteng Kang, Zhan Shi, Xinhua Zhang",,,,,,,,, Toward Large Kernel Models,"Amirhesam Abedsoltan, Misha Belkin, Parthe Pandit",http://arxiv.org/abs/2302.02605,,https://huggingface.co/papers/2302.02605,,,,2302.02605,3,0 ELSA: Efficient Label Shift Adaptation through the Lens of Semiparametric Models,"Qinglong Tian, Xin Zhang, Jiwei Zhao",http://arxiv.org/abs/2305.19123,,https://huggingface.co/papers/2305.19123,,,,2305.19123,3,0 Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute,"Michiel de Jong, Yury Zemlyanskiy, Nicholas FitzGerald, Joshua Ainslie, Sumit Sanghai, Fei Sha, William Cohen",http://arxiv.org/abs/2301.10448,,https://huggingface.co/papers/2301.10448,,,,2301.10448,7,0 Mixture Proportion Estimation Beyond Irreducibility,"Yilun Zhu, Aaron Fjeldsted, Darren Holland, George Landon, Azaree Lintereur, Clay Scott",http://arxiv.org/abs/2306.01253,,https://huggingface.co/papers/2306.01253,,,,2306.01253,6,0 Difference-in-Differences Meets Tree-based Methods: Heterogeneous Treatment Effects Estimation with Unmeasured Confounding,"Caizhi Tang, Huiyuan Wang, Xinyu Li, Qing Cui, Longfei Li, JUN ZHOU",,,,,,,,, Test-Time Style Shifting: Handling Arbitrary Styles in Domain Generalization,"Jungwuk Park, Dong-Jun Han, Soyeong Kim, Jaekyun Moon",http://arxiv.org/abs/2306.04911,,https://huggingface.co/papers/2306.04911,,,,2306.04911,4,0 Distributional Offline Policy Evaluation with Predictive Error Guarantees,"Runzhe Wu, Masatoshi Uehara, Wen Sun",http://arxiv.org/abs/2302.09456,,https://huggingface.co/papers/2302.09456,,,,2302.09456,3,0 FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization,"Jung Hyun Lee, Jeonghoon Kim, Se Jung Kwon, Dongsoo Lee",http://arxiv.org/abs/2306.00317,,https://huggingface.co/papers/2306.00317,,,,2306.00317,4,1 GREAD: Graph Neural Reaction-Diffusion Networks,"Jeongwhan Choi, Seoyoung Hong, Noseong Park, Sung-Bae Cho",http://arxiv.org/abs/2211.14208,,https://huggingface.co/papers/2211.14208,,,,2211.14208,4,1 Towards Trustworthy Explanation: On Causal Rationalization,"Wenbo Zhang, TONG WU, Yunlong Wang, Yong Cai, Hengrui Cai",,,,,,,,, Improved Algorithms for White-Box Adversarial Streams,"Ying Feng, David Woodruff",,,,,,,,, Towards Understanding Ensemble Distillation in Federated Learning,"Sejun Park, Kihun Hong, Ganguk Hwang",,,,,,,,, Does Sparsity Help in Learning Misspecified Linear Bandits?,"Jialin Dong, Lin Yang",http://arxiv.org/abs/2303.16998,,https://huggingface.co/papers/2303.16998,,,,2303.16998,2,0 On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline,"Nicklas Hansen, Zhecheng Yuan, Yanjie Ze, Tongzhou Mu, Aravind Rajeswaran, Hao Su, Huazhe Xu, Xiaolong Wang",http://arxiv.org/abs/2212.05749,,https://huggingface.co/papers/2212.05749,,,,2212.05749,8,2 PLay: Parametrically Conditioned Layout Generation using Latent Diffusion,"Chin-Yi Cheng, Forrest Huang, Gang Li, Yang Li",http://arxiv.org/abs/2301.11529,,https://huggingface.co/papers/2301.11529,,,,2301.11529,4,0 CoDi: Co-evolving Contrastive Diffusion Models for Mixed-type Tabular Synthesis,"Chaejeong Lee, Jayoung Kim, Noseong Park",http://arxiv.org/abs/2304.12654,,https://huggingface.co/papers/2304.12654,,,,2304.12654,3,0 Improving Medical Predictions by Irregular Multimodal Electronic Health Records Modeling,"Xinlu Zhang, Shiyang Li, Zhiyu Chen, Xifeng Yan, Linda Petzold",http://arxiv.org/abs/2210.12156,,https://huggingface.co/papers/2210.12156,,,,2210.12156,5,1 EF21-P and Friends: Improved Theoretical Communication Complexity for Distributed Optimization with Bidirectional Compression,"Kaja Gruntkowska, Alexander Tyurin, Peter Richtarik",,,,,,,,, Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies,"Ilyas Fatkhullin, Anas Barakat, Anastasia Kireeva, Niao He",http://arxiv.org/abs/2302.01734,,https://huggingface.co/papers/2302.01734,,,,2302.01734,4,0 Learning to Bid in Repeated First-Price Auctions with Budgets,"Qian Wang, Zongjun Yang, Xiaotie Deng, Yuqing Kong",http://arxiv.org/abs/2304.13477,,https://huggingface.co/papers/2304.13477,,,,2304.13477,4,0 On the Convergence of Federated Averaging with Cyclic Client Participation,"Yae Jee Cho, PRANAY SHARMA, Gauri Joshi, Zheng Xu, Satyen Kale, Tong Zhang",http://arxiv.org/abs/2302.03109,,https://huggingface.co/papers/2302.03109,,,,2302.03109,6,0 Estimating Joint Treatment Effects by Combining Multiple Experiments,"Yonghan Jung, Jin Tian, Elias Bareinboim",,,,,,,,, From Robustness to Privacy and Back,"Hilal Asi, Jonathan Ullman, Lydia Zakynthinou",http://arxiv.org/abs/2302.01855,,https://huggingface.co/papers/2302.01855,,,,2302.01855,3,0 MALTS: Matching After Learning to Stretch,"Harsh Parikh, Cynthia Rudin, Alexander Volfovsky",http://arxiv.org/abs/1811.07415,,https://huggingface.co/papers/1811.07415,,,,1811.07415,3,0 "System identification of neural systems: If we got it right, would we know?","Yena Han, Tomaso A Poggio, Brian Cheung",http://arxiv.org/abs/2302.06677,,https://huggingface.co/papers/2302.06677,,,,2302.06677,3,0 Optimal Online Generalized Linear Regression with Stochastic Noise and Its Application to Heteroscedastic Bandits,"Heyang Zhao, Dongruo Zhou, Jiafan He, Jiafan He, Quanquan Gu",http://arxiv.org/abs/2202.13603,,https://huggingface.co/papers/2202.13603,,,,2202.13603,4,1 Federated Hypergradient Computation via Aggregated Iterative Differentiation,"Peiyao Xiao, Kaiyi Ji",,,,,,,,, Auto-Differentiation of Relational Computations for Very Large Scale Machine Learning,"Yuxin Tang, Zhimin Ding, Dimitrije Jankov, Binhang Yuan, Daniel Bourgeois, Chris Jermaine",http://arxiv.org/abs/2306.00088,,https://huggingface.co/papers/2306.00088,,,,2306.00088,6,0 Understanding Backdoor Attacks through the Adaptability Hypothesis,"Xun Xian, Ganghua Wang, Jayanth Srinivasa, Ashish Kundu, Xuan Bi, Mingyi Hong, Jie Ding",,,,,,,,, Towards Robust and Safe Reinforcement Learning with Benign Off-policy Data,"Zuxin Liu, Zijian Guo, Zhepeng Cen, Huan Zhang, Yihang Yao, Hanjiang Hu, Ding Zhao",,,,,,,,, Atari-5: Distilling the Arcade Learning Environment down to Five Games,"Matthew Aitchison, Penny Sweetser, Marcus Hutter",,,,,,,,, "Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC","Yilun Du, Conor Durkan, Robin Strudel, Josh Tenenbaum, Sander Dieleman, Rob Fergus, Jascha Sohl-Dickstein, Arnaud Doucet, Will Grathwohl",http://arxiv.org/abs/2302.11552,,https://huggingface.co/papers/2302.11552,,,,2302.11552,9,1 Tighter Analysis for ProxSkip,"Zhengmian Hu, Heng Huang",,,,,,,,, """Why did the Model Fail?"": Attributing Model Performance Changes to Distribution Shifts","Haoran Zhang, Harvineet Singh, Marzyeh Ghassemi, Shalmali Joshi",http://arxiv.org/abs/2210.10769,,https://huggingface.co/papers/2210.10769,,,,2210.10769,4,1 Fast Rates in Time-Varying Strongly Monotone Games,"Yu-Hu Yan, Peng Zhao, Zhi-Hua Zhou",,,,,,,,, Understanding Oversquashing in GNNs through the Lens of Effective Resistance,"Mitchell Black, Zhengchao Wan, Amir Nayyeri, Yusu Wang",http://arxiv.org/abs/2302.06835,,https://huggingface.co/papers/2302.06835,,,,2302.06835,4,0 Traversing Between Modes in Function Space for Fast Ensembling,"Eunggu Yun, Hyungi Lee, Giung Nam, Juho Lee",,,,,,,,, Optimal Sets and Solution Paths of ReLU Networks,"Aaron Mishkin, Mert Pilanci",http://arxiv.org/abs/2306.00119,,https://huggingface.co/papers/2306.00119,,,,2306.00119,2,1 Towards a Persistence Diagram that is Robust to Noise and Varied Densities,"Hang Zhang, Kaifeng Zhang, Kai Ming Ting, Ye Zhu",,,,,,,,, Understanding and Generalizing Contrastive Learning from the Inverse Optimal Transport Perspective,"Liangliang Shi, Gu Zhang, Haoyu Zhen, Jintao Fan, Junchi Yan",,,,,,,,, On the Impact of Algorithmic Recourse on Social Segregation,"Ruijiang Gao, Himabindu Lakkaraju",,,,,,,,, A Universal Unbiased Method for Classification from Aggregate Observations,"Zixi Wei, LEI FENG, Bo Han, Tongliang Liu, Gang Niu, Xiaofeng Zhu, Heng Tao Shen",,,,,,,,, "Neural Latent Aligner: Cross-trial Alignment for Learning Representations of Complex, Naturalistic Neural Data","Cheol Jun Cho, Edward Chang, Gopala Anumanchipalli",,,,,,,,, Cyclic Block Coordinate Descent With Variance Reduction for Composite Nonconvex Optimization,"Xufeng Cai, Chaobing Song, Stephen Wright, Jelena Diakonikolas",http://arxiv.org/abs/2212.05088,,https://huggingface.co/papers/2212.05088,,,,2212.05088,4,1 Meta-Learning the Inductive Bias of Simple Neural Circuits,"Will Dorrell, Maria Yuffa, Peter Latham",,,,,,,,, Secure Federated Correlation Test and Entropy Estimation,"Qi Pang, Lun Wang, Shuai Wang, Wenting Zheng, Dawn Song",,,,,,,,, SpENCNN: Orchestrating Encoding and Sparsity for Fast Homomorphically Encrypted Neural Network Inference,"Ran Ran, Xinwei Luo, Wei Wang, Tao Liu, Gang Quan, Xiaolin Xu, Caiwen Ding, Wujie Wen",,,,,,,,, Streaming Active Learning with Deep Neural Networks,"Akanksha Saran, Safoora Yousefi, Akshay Krishnamurthy, John Langford, Jordan Ash",http://arxiv.org/abs/2303.02535,,https://huggingface.co/papers/2303.02535,,,,2303.02535,5,1 Hiding Data Helps: On the Benefits of Masking for Sparse Coding,"Muthu Chidambaram, Chenwei Wu, Yu Cheng, Rong Ge",http://arxiv.org/abs/2302.12715,,https://huggingface.co/papers/2302.12715,,,,2302.12715,4,0 Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path,"Qiwei Di, Jiafan He, Jiafan He, Dongruo Zhou, Quanquan Gu",,,,,,,,, Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling,"Kolby Nottingham, Prithviraj Ammanabrolu, Alane Suhr, Yejin Choi, Hannaneh Hajishirzi, Sameer Singh, Roy Fox",http://arxiv.org/abs/2301.12050,,https://huggingface.co/papers/2301.12050,,,,2301.12050,7,0 Learning Regions of Interest for Bayesian Optimization with Adaptive Level-Set Estimation,"Fengxue Zhang, Jialin Song, James Bowden, Alexander Ladd, Yisong Yue, Thomas Desautels, Yuxin Chen",,,,,,,,, Biases in Evaluation of Molecular Optimization Methods and Bias Reduction Strategies,"Hiroshi Kajino, Kohei Miyaguchi, Takayuki Osogami",,,,,,,,, Stabilizing GANs' Training with Brownian Motion Controller,"Tianjiao Luo, Ziyu Zhu, Jianfei Chen, Jun Zhu",,,,,,,,, Hierarchical Neural Coding for Controllable CAD Model Generation,"Xiang Xu, Pradeep Kumar Jayaraman, Joseph G Lambourne, Karl Willis, Yasutaka Furukawa",,,,,,,,, One-sided Matrix Completion from Two Observations Per Row,"Steven Cao, Percy Liang, Greg Valiant",http://arxiv.org/abs/2306.04049,,https://huggingface.co/papers/2306.04049,,,,2306.04049,3,0 Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation,"Zechu Li, Tao Chen, Zhang-Wei Hong, Anurag Ajay, Pulkit Agrawal",,,,,,,,, Efficient List-Decodable Regression using Batches,"Abhimanyu Das, Ayush Jain, Weihao Kong, Rajat Sen",http://arxiv.org/abs/2211.12743,,https://huggingface.co/papers/2211.12743,,,,2211.12743,4,0 Proper Scoring Rules for Survival Analysis,Hiroki Yanagisawa,http://arxiv.org/abs/2305.00621,,https://huggingface.co/papers/2305.00621,,,,2305.00621,1,0 GraphCleaner: Detecting Mislabelled Samples in Popular Graph Learning Benchmarks,"Yuwen Li, Miao Xiong, Bryan Hooi",http://arxiv.org/abs/2306.00015,,https://huggingface.co/papers/2306.00015,,,,2306.00015,3,0 Large Language Models Can Be Easily Distracted by Irrelevant Context,"Haoyue Shi, Xinyun Chen, Kanishka Misra, Nathan Scales, David Dohan, Ed Chi, Nathanael Schärli, Denny Zhou",http://arxiv.org/abs/2302.00093,,https://huggingface.co/papers/2302.00093,,,,2302.00093,8,1 Temporally Consistent Transformers for Video Generation,"Wilson Yan, Danijar Hafner, Stephen James, Pieter Abbeel",http://arxiv.org/abs/2210.02396,,https://huggingface.co/papers/2210.02396,,,,2210.02396,4,1 Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit Feedback,"Wonyoung Kim, Garud Iyengar, Assaf Zeevi",http://arxiv.org/abs/2301.13791,,https://huggingface.co/papers/2301.13791,,,,2301.13791,3,0 Scaling Laws for Generative Mixed-Modal Language Models,"Armen Aghajanyan, LILI YU, Alexis Conneau, Wei-Ning Hsu, Karen Hambardzumyan, Susan Zhang, Stephen Roller, Naman Goyal, Omer Levy, Luke Zettlemoyer",http://arxiv.org/abs/2301.03728,,https://huggingface.co/papers/2301.03728,,,,2301.03728,10,0 Anchor Sampling for Federated Learning with Partial Client Participation,"Feijie Wu, Song Guo, Zhihao Qu, Shiqi He, Ziming Liu, Jing Gao",http://arxiv.org/abs/2206.05891,,https://huggingface.co/papers/2206.05891,,,,2206.05891,6,0 Learning to Initiate and Reason in Event-Driven Cascading Processes,"Yuval Atzmon, Eli Meirom, Shie Mannor, Gal Chechik",,,,,,,,, GOAT: A Global Transformer on Large-scale Graphs,"Kezhi Kong, Jiuhai Chen, John Kirchenbauer, Renkun Ni, C. Bayan Bruss, Tom Goldstein",,,,,,,,, One-Shot Compression of Large Edge-Exchangeable Graphs using Bits-Back Coding,"Daniel Severo, James Townsend, Ashish Khisti, Alireza Makhzani",,,,,,,,, Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback,"Tal Lancewicki, Aviv Rosenberg, Dmitry Sotnikov",http://arxiv.org/abs/2305.07911,,https://huggingface.co/papers/2305.07911,,,,2305.07911,3,0 Restoration based Generative Models,"Jaemoo Choi, Yesom Park, Myungjoo Kang",http://arxiv.org/abs/2303.05456,,https://huggingface.co/papers/2303.05456,,,,2303.05456,3,0 Accelerated Infeasibility Detection of Constrained Optimization and Fixed-Point Iterations,"Jisun Park, Ernest Ryu",http://arxiv.org/abs/2303.15876,,https://huggingface.co/papers/2303.15876,,,,2303.15876,2,0 Integrating Prior Knowledge in Contrastive Learning with Kernel,"Benoit Dufumier, Carlo Alberto Barbano, Robin Louiset, Edouard Duchesnay, Pietro Gori",http://arxiv.org/abs/2206.01646,,https://huggingface.co/papers/2206.01646,,,,2206.01646,5,0 Adaptive Compositional Continual Meta-Learning,"Bin Wu, Jinyuan Fang, xiangxiang Zeng, Shangsong Liang, Qiang Zhang",,,,,,,,, Bandits with Knapsacks: Advice on Time-Varying Demands,"Lixing Lyu, Wang Chi Cheung",,,,,,,,, Emergent Agentic Transformer from Chain of Hindsight Experience,"Hao Liu, Pieter Abbeel",http://arxiv.org/abs/2305.16554,,https://huggingface.co/papers/2305.16554,,,,2305.16554,2,1 Graph Neural Networks with Learnable and Optimal Polynomial Bases,"Yuhe Guo, Zhewei Wei",http://arxiv.org/abs/2302.12432,,https://huggingface.co/papers/2302.12432,,,,2302.12432,2,0 Hyperbolic Diffusion Embedding and Distance for Hierarchical Representation Learning,"Ya-Wei Eileen Lin, Ronald Coifman, Gal Mishne, Ronen Talmon",http://arxiv.org/abs/2305.18962,,https://huggingface.co/papers/2305.18962,,,,2305.18962,4,0 Optimizing Hyperparameters with Conformal Quantile Regression,"David Salinas, Jacek Golebiowski, Aaron Klein, Matthias Seeger, Cedric Archambeau",http://arxiv.org/abs/2305.03623,,https://huggingface.co/papers/2305.03623,,,,2305.03623,5,1 Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards,"Yulian Wu, Xingyu Zhou, Sayak Ray Chowdhury, Di Wang",http://arxiv.org/abs/2306.01121,,https://huggingface.co/papers/2306.01121,,,,2306.01121,4,0 SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient,"Max Ryabinin, Tim Dettmers, Michael Diskin, Alexander Borzunov",http://arxiv.org/abs/2301.11913,,https://huggingface.co/papers/2301.11913,,,,2301.11913,4,1 Improved Active Multi-Task Representation Learning via Lasso,"Yiping Wang, Yifang Chen, Kevin Jamieson, Simon Du",http://arxiv.org/abs/2306.02556,,https://huggingface.co/papers/2306.02556,,,,2306.02556,4,0 SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification,"Pranjal Aggarwal, Ameet Deshpande, Karthik Narasimhan",,,,,,,,, Emergence of Sparse Representations from Noise,"Trenton Bricken, Rylan Schaeffer, Bruno Olshausen, Gabriel Kreiman",,,,,,,,, GFlowOut: Dropout with Generative Flow Networks,"Dianbo Liu, Moksh Jain, Bonaventure F. P. Dossou, Qianli Shen, Salem Lahlou, Anirudh Goyal, Nikolay Malkin, Chris Emezue, Dinghuai Zhang, Nadhir Hassen, Xu Ji, Kenji Kawaguchi, Yoshua Bengio",http://arxiv.org/abs/2210.12928,,https://huggingface.co/papers/2210.12928,,,,2210.12928,13,1 End-to-End Learning for Stochastic Optimization: A Bayesian Perspective,"Yves Rychener, Daniel Kuhn, Tobias Sutter",http://arxiv.org/abs/2306.04174,,https://huggingface.co/papers/2306.04174,,,,2306.04174,3,0 Towards Practical Preferential Bayesian Optimization with Skew Gaussian Processes,"Shion Takeno, Masahiro Nomura, Masayuki Karasuyama",http://arxiv.org/abs/2302.01513,,https://huggingface.co/papers/2302.01513,,,,2302.01513,3,0 Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling,"Yunfan Li, Yiran Wang, Yu Cheng, Lin Yang",http://arxiv.org/abs/2306.09554,,https://huggingface.co/papers/2306.09554,,,,2306.09554,4,0 A Theoretical Analysis of the Learning Dynamics under Class Imbalance,"Emanuele Francazi, Marco Baity-Jesi, Aurelien Lucchi",http://arxiv.org/abs/2207.00391,,https://huggingface.co/papers/2207.00391,,,,2207.00391,3,1 Dynamic Constrained Submodular Optimization with Polylogarithmic Update Time,"Kiarash Banihashem, Leyla Biabani, Samira Goudarzi, MohammadTaghi Hajiaghayi, Peyman Jabbarzade, Morteza Monemizadeh",http://arxiv.org/abs/2305.15192,,https://huggingface.co/papers/2305.15192,,,,2305.15192,6,0 Out-of-Domain Robustness via Targeted Augmentations,"Irena Gao, Shiori Sagawa, Pang Wei Koh, Tatsunori Hashimoto, Percy Liang",http://arxiv.org/abs/2302.11861,,https://huggingface.co/papers/2302.11861,,,,2302.11861,5,1 An Instrumental Variable Approach to Confounded Off-Policy Evaluation,"Yang Xu, Jin Zhu, Chengchun Shi, Shikai Luo, Rui Song",http://arxiv.org/abs/2212.14468,,https://huggingface.co/papers/2212.14468,,,,2212.14468,5,0 Adaptive Smoothing Gradient Learning for Spiking Neural Networks,"Ziming Wang, Runhao Jiang, Shuang Lian, Rui Yan, Huajin Tang",,,,,,,,, The Catalog Problem: Clustering and Ordering Variable-Sized Sets,"Mateusz Jurewicz, Graham Taylor, Leon Derczynski",,,,,,,,, On Second-Order Scoring Rules for Epistemic Uncertainty Quantification,"Viktor Bengs, Eyke Hüllermeier, Willem Waegeman",http://arxiv.org/abs/2301.12736,,https://huggingface.co/papers/2301.12736,,,,2301.12736,3,1 The Power of Uniform Sampling for k-Median,"Lingxiao Huang, Shaofeng Jiang, Jianing Lou",,,,,,,,, Training Normalizing Flows from Dependent Data,"Matthias Kirchler, Christoph Lippert, Marius Kloft",http://arxiv.org/abs/2209.14933,,https://huggingface.co/papers/2209.14933,,,,2209.14933,3,0 Learning to Suggest Breaks: Sustainable Optimization of Long-Term User Engagement,"Eden Saig, Nir Rosenfeld",http://arxiv.org/abs/2211.13585,,https://huggingface.co/papers/2211.13585,,,,2211.13585,2,1 A Model-free Closeness-of-influence Test for Features in Supervised Learning,"Mohammad Mehrabi, Ryan A Rossi",,,,,,,,, What Can Be Learnt With Wide Convolutional Neural Networks?,"Francesco Cagnetta, Alessandro Favero, Matthieu Wyart",http://arxiv.org/abs/2208.01003,,https://huggingface.co/papers/2208.01003,,,,2208.01003,3,0 Convergence of Proximal Point and Extragradient-Based Methods Beyond Monotonicity: the Case of Negative Comonotonicity,"Eduard Gorbunov, Adrien Taylor, Samuel Horváth, Gauthier Gidel",http://arxiv.org/abs/2210.13831,,https://huggingface.co/papers/2210.13831,,,,2210.13831,4,1 Revisiting Gradient Clipping: Stochastic bias and tight convergence guarantees,"Anastasiia Koloskova, Hadrien Hendrikx, Sebastian Stich",http://arxiv.org/abs/2305.01588,,https://huggingface.co/papers/2305.01588,,,,2305.01588,3,0 Bag of Tricks for Training Data Extraction from Language Models,"Weichen Yu, Tianyu Pang, Qian Liu, Chao Du, Bingyi Kang, Yan Huang, Min Lin, Shuicheng YAN",http://arxiv.org/abs/2302.04460,https://github.com/weichen-yu/LM-Extraction,https://huggingface.co/papers/2302.04460,,,,2302.04460,8,4 Quantum 3D Graph Learning with Applications to Molecule Embedding,"Ge Yan, Huaijin Wu, Junchi Yan",,,,,,,,, Efficient Learning of Mesh-Based Physical Simulation with Bi-Stride Multi-Scale Graph Neural Network,"Yadi Cao, Menglei Chai, Minchen Li, Chenfanfu Jiang",,,,,,,,, Phase-aware Adversarial Defense for Improving Adversarial Robustness,"Dawei Zhou, Nannan Wang, Heng Yang, Xinbo Gao, Tongliang Liu",,,,,,,,, LeadFL: Client Self-Defense against Model Poisoning in Federated Learning,"Chaoyi Zhu, Stefanie Roos, Lydia Y. Chen",,,,,,,,, Subset Selection Based On Multiple Rankings in the Presence of Bias: Effectiveness of Fairness Constraints for Multiwinner Voting Score Functions,"Niclas Boehmer, L. Elisa Celis, Lingxiao Huang, Anay Mehrotra, Nisheeth K. Vishnoi",http://arxiv.org/abs/2306.09835,,https://huggingface.co/papers/2306.09835,,,,2306.09835,5,0 FedAvg Converges to Zero Training Loss Linearly for Overparameterized Multi-Layer Neural Networks,"Bingqing Song, Prashant Khanduri, xinwei zhang, Jinfeng Yi, Mingyi Hong",,,,,,,,, Linkless Link Prediction via Relational Distillation,"Zhichun Guo, William Shiao, Shichang Zhang, Yozen Liu, Nitesh Chawla, Neil Shah, Tong Zhao",http://arxiv.org/abs/2210.05801,,https://huggingface.co/papers/2210.05801,,,,2210.05801,7,0 High-Probability Bounds for Stochastic Optimization and Variational Inequalities: the Case of Unbounded Variance,"Abdurakhmon Sadiev, Marina Danilova, Eduard Gorbunov, Samuel Horváth, Gauthier Gidel, Pavel Dvurechenskii, Alexander Gasnikov, Peter Richtarik",http://arxiv.org/abs/2302.00999,,https://huggingface.co/papers/2302.00999,,,,2302.00999,8,0 Neural Status Registers,"Lukas Faber, Roger Wattenhofer",http://arxiv.org/abs/2004.07085,,https://huggingface.co/papers/2004.07085,,,,2004.07085,2,0 Dissecting the Effects of SGD Noise in Distinct Regimes of Deep Learning,"Antonio Sclocchi, Mario Geiger, Matthieu Wyart",http://arxiv.org/abs/2301.13703,,https://huggingface.co/papers/2301.13703,,,,2301.13703,3,1 Importance Weighted Variational Bayes for Protein Sequence Design,"Zhenqiao Song, Lei Li",,,,,,,,, Transformers as Algorithms: Generalization and Stability in In-context Learning,"Yingcong Li, Muhammed Ildiz, Dimitris Papailiopoulos, Samet Oymak",http://arxiv.org/abs/2301.07067,,https://huggingface.co/papers/2301.07067,,,,2301.07067,4,1 Towards Quantum Machine Learning for Constrained Combinatorial Optimization: a Quantum QAP Solver,"Xinyu Ye, Ge Yan, Junchi Yan",,,,,,,,, A Picture of the Space of Typical Learnable Tasks,"Rahul Ramesh, Jialin Mao, Itay Griniasty, Rubing Yang, Han Kheng Teoh, Mark Transtrum, James Sethna, Pratik Chaudhari",http://arxiv.org/abs/2210.17011,,https://huggingface.co/papers/2210.17011,,,,2210.17011,8,0 Accounting For Informative Sampling When Learning to Forecast Treatment Outcomes Over Time,"Toon Vanderschueren, Alicia Curth, Wouter Verbeke, Mihaela van der Schaar",http://arxiv.org/abs/2306.04255,,https://huggingface.co/papers/2306.04255,,,,2306.04255,4,0 AudioLDM: Text-to-Audio Generation with Latent Diffusion Models,"Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo Mandic, Wenwu Wang, Mark D Plumbley",http://arxiv.org/abs/2301.12503,,https://huggingface.co/papers/2301.12503,,,,2301.12503,8,1 Revisiting Over-smoothing and Over-squashing Using Ollivier-Ricci Curvature,"Khang Nguyen, Nong Hieu, Vinh NGUYEN, Nhat Ho, Stanley Osher, TAN NGUYEN",http://arxiv.org/abs/2211.15779,,https://huggingface.co/papers/2211.15779,,,,2211.15779,6,1 Lifelong Language Pretraining with Distribution-Specialized Experts,"Wuyang Chen, Yanqi Zhou, Nan Du, Yanping Huang, James Laudon, Zhifeng Chen, Claire Cui",http://arxiv.org/abs/2305.12281,,https://huggingface.co/papers/2305.12281,,,,2305.12281,7,1 Delay-agnostic Asynchronous Coordinate Update Algorithm,"Xuyang Wu, Changxin Liu, Sindri Magnússon, Mikael Johansson",http://arxiv.org/abs/2305.08535,,https://huggingface.co/papers/2305.08535,,,,2305.08535,4,1 Prototype-oriented unsupervised anomaly detection for multivariate time series,"yuxin li, Wenchao Chen, Bo Chen, Dongsheng Wang, Long Tian, Mingyuan Zhou",,,,,,,,, ClimaX: A foundation model for weather and climate,"Tung Nguyen, Johannes Brandstetter, Ashish Kapoor, Jayesh K. Gupta, Aditya Grover",http://arxiv.org/abs/2301.10343,,https://huggingface.co/papers/2301.10343,,,,2301.10343,5,1 One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale,"Fan Bao, Shen Nie, Kaiwen Xue, Chongxuan Li, Shi Pu, Yaole Wang, Gang Yue, Yue Cao, Hang Su, Jun Zhu",http://arxiv.org/abs/2303.06555,,https://huggingface.co/papers/2303.06555,,,,2303.06555,10,0 Diverse and Faithful Knowledge-Grounded Dialogue Generation via Sequential Posterior Inference,"Yan Xu, Deqian Kong, Dehong Xu, Ziwei Ji, Bo Pang, Pascale FUNG, Ying Nian Wu",http://arxiv.org/abs/2306.01153,,https://huggingface.co/papers/2306.01153,,,,2306.01153,7,1 Sequential Multi-Dimensional Self-Supervised Learning for Clinical Time Series,"Aniruddh Raghu, Payal Chandak, Ridwan Alam, John Guttag, Collin Stultz",,,,,,,,, Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments,"Daniel Jarrett, Corentin Tallec, Florent Altché, Thomas Mesnard, Remi Munos, Michal Valko",,,,,,,,, Go Beyond Imagination: Maximizing Episodic Reachability with World Models,"Yao Fu, Run Peng, Honglak Lee",,,,,,,,, Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models,"Guanhua Zhang, Jiabao Ji, Yang Zhang, Mo Yu, Tommi Jaakkola, Shiyu Chang",http://arxiv.org/abs/2304.03322,https://github.com/UCSB-NLP-Chang/CoPaint,https://huggingface.co/papers/2304.03322,,,,2304.03322,6,2 Chemically Transferable Generative Backmapping of Coarse-Grained Proteins,"Soojung Yang, Rafael Gomez-Bombarelli",http://arxiv.org/abs/2303.01569,,https://huggingface.co/papers/2303.01569,,,,2303.01569,2,0 MonoFlow: Rethinking Divergence GANs via the Perspective of Differential Equations,"Mingxuan Yi, Zhanxing Zhu, Song Liu",,,,,,,,, Propensity Matters: Measuring and Enhancing Balancing for Recommendation,"Haoxuan Li, Yanghao Xiao, Chunyuan Zheng, Peng Wu, Peng Cui",,,,,,,,, Directed Chain Generative Adversarial Networks,"Ming Min, Ruimeng Hu, Tomoyuki Ichiba",http://arxiv.org/abs/2304.13131,,https://huggingface.co/papers/2304.13131,,,,2304.13131,3,0 Nearly-Linear Time and Streaming Algorithms for Outlier-Robust PCA,"Ilias Diakonikolas, Daniel Kane, Ankit Pensia, Thanasis Pittas",http://arxiv.org/abs/2305.02544,,https://huggingface.co/papers/2305.02544,,,,2305.02544,4,0 Under-Counted Tensor Completion with Neural Incorporation of Attributes,"Shahana Ibrahim, Xiao Fu, Rebecca Hutchinson, Eugene Seo",http://arxiv.org/abs/2306.03273,,https://huggingface.co/papers/2306.03273,,,,2306.03273,4,0 Multi-class Graph Clustering via Approximated Effective $p$-Resistance,"Shota Saito, Mark Herbster",,,,,,,,, Last Switch Dependent Bandits with Monotone Payoff Functions,"Ayoub Foussoul, Vineet Goyal, Orestis Papadigenopoulos, Assaf Zeevi",http://arxiv.org/abs/2306.00338,,https://huggingface.co/papers/2306.00338,,,,2306.00338,4,0 Meta-learning Parameterized Skills,"Haotian Fu, Shangqun Yu, Saket Tiwari, Michael L. Littman, George Konidaris",http://arxiv.org/abs/2206.03597,,https://huggingface.co/papers/2206.03597,,,,2206.03597,5,1 Leveraging Label Non-Uniformity for Node Classification in Graph Neural Networks,"Feng Ji, See Hian Lee, Meng HanYang, Kai Zhao, Wee Peng Tay, Jielong Yang",http://arxiv.org/abs/2305.00139,,https://huggingface.co/papers/2305.00139,,,,2305.00139,6,0 Tuning Computer Vision Models With Task Rewards,"André Susano Pinto, Alexander Kolesnikov, Yuge Shi, Lucas Beyer, Xiaohua Zhai",http://arxiv.org/abs/2302.08242,,https://huggingface.co/papers/2302.08242,,,,2302.08242,5,1 State and parameter learning with PARIS particle Gibbs,"Gabriel Cardoso, Yazid Janati el idrissi, Sylvain Le Corff, Eric Moulines, Jimmy Olsson",http://arxiv.org/abs/2301.00900,,https://huggingface.co/papers/2301.00900,,,,2301.00900,5,0 An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning,"Jaesik Yoon, Yi-Fu Wu, Heechul Bae, Sungjin Ahn",http://arxiv.org/abs/2302.04419,,https://huggingface.co/papers/2302.04419,,,,2302.04419,4,0 Nested Elimination: A Simple Algorithm for Best-Item Identification From Choice-Based Feedback,"Junwen Yang, Yifan Feng",,,,,,,,, The Edge of Orthogonality: A Simple View of What Makes BYOL Tick,"Pierre Richemond, Allison Tam, Yunhao Tang, Florian Strub, Bilal Piot, Feilx Hill",http://arxiv.org/abs/2302.04817,,https://huggingface.co/papers/2302.04817,,,,2302.04817,6,0 Nonparametric Generative Modeling with Conditional Sliced-Wasserstein Flows,"Chao Du, Tianbo Li, Tianyu Pang, Shuicheng YAN, Min Lin",http://arxiv.org/abs/2305.02164,,https://huggingface.co/papers/2305.02164,,,,2305.02164,5,3 Geometric Clifford Algebra Networks,"David Ruhe, Jayesh K. Gupta, Steven De Keninck, Max Welling, Johannes Brandstetter",http://arxiv.org/abs/2302.06594,,https://huggingface.co/papers/2302.06594,,,,2302.06594,5,1 Federated Linear Contextual Bandits with User-level Differential Privacy,"Ruiquan Huang, Huanyu Zhang, Meisam Hejazinia, Luca Melis, Milan Shen, Jing Yang",http://arxiv.org/abs/2306.05275,,https://huggingface.co/papers/2306.05275,,,,2306.05275,6,0 ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs,"Ted Moskovitz, Brendan O'Donoghue, Vivek Veeriah, Sebastian Flennerhag, Satinder Singh, Tom Zahavy",http://arxiv.org/abs/2302.01275,,https://huggingface.co/papers/2302.01275,,,,2302.01275,6,0 Extrapolated Random Tree for Regression,"Yuchao Cai, Yuheng Ma, Yiwei Dong, Hanfang Yang",,,,,,,,, "Robust Consensus in Ranking Data Analysis: Definitions, Properties and Computational Issues","Morgane Goibert, Clément Calauzènes, Ekhine IRUROZKI, Stephan Clemencon",http://arxiv.org/abs/2303.12878,,https://huggingface.co/papers/2303.12878,,,,2303.12878,4,1 Learning Deductive Reasoning from Synthetic Corpus based on Formal Logic,"Terufumi Morishita, Gaku Morio, Atsuki Yamaguchi, Yasuhiro Sogawa",,,,,,,,, TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation,"Zhaoyan Liu, Noël Vouitsis, Satya Krishna Gorti, Jimmy Ba, Gabriel Loaiza-Ganem",,,,,,,,, Algorithmic Stability of Heavy-Tailed SGD with General Loss Functions,"Anant Raj, Lingjiong Zhu, Mert Gurbuzbalaban, Umut Simsekli",http://arxiv.org/abs/2301.11885,,https://huggingface.co/papers/2301.11885,,,,2301.11885,4,0 Continual Task Allocation in Meta-Policy Network via Sparse Prompting,"Yijun Yang, Tianyi Zhou, Jing Jiang, Guodong Long, Yuhui Shi",http://arxiv.org/abs/2305.18444,,https://huggingface.co/papers/2305.18444,,,,2305.18444,5,1 Neural Collapse in Deep Linear Networks: From Balanced to Imbalanced Data,"Hien Dang, Tho Tran Huu, TAN NGUYEN, Stanley Osher, Hung Tran-The, Nhat Ho",http://arxiv.org/abs/2301.00437,,https://huggingface.co/papers/2301.00437,,,,2301.00437,6,0 Causal Bounds in Quasi-Markovian Graphs,"Madhumitha Shridharan, Garud Iyengar",,,,,,,,, "On Over-Squashing in Message Passing Neural Networks: The Impact of Width, Depth, and Topology","Francesco Di Giovanni, Lorenzo Giusti, Federico Barbero, Giulia Luise, Pietro Lió, Michael Bronstein",http://arxiv.org/abs/2302.02941,,https://huggingface.co/papers/2302.02941,,,,2302.02941,6,0 Prototype-Sample Relation Distillation: Towards Replay-Free Continual Learning,"Nader Asadi, MohammadReza Davari, Sudhir Mudur, Rahaf Aljundi, Eugene Belilovsky",http://arxiv.org/abs/2303.14771,,https://huggingface.co/papers/2303.14771,,,,2303.14771,5,0 High Fidelity Image Counterfactuals with Probabilistic Causal Models,"Fabio De Sousa Ribeiro, Tian Xia, Miguel Monteiro, Nick Pawlowski, Ben Glocker",,,,,,,,, Learn to Accumulate Evidence from All Training Samples: Theory and Practice,"Deep Pandey, Qi Yu",,,,,,,,, Quantitative Universal Approximation Bounds for Deep Belief Networks,"Julian Sieber, Johann Gehringer",http://arxiv.org/abs/2208.09033,,https://huggingface.co/papers/2208.09033,,,,2208.09033,2,0 Generalization Bounds using Data-Dependent Fractal Dimensions,"Benjamin Dupuis, George Deligiannidis, Umut Simsekli",,,,,,,,, Evidential Interactive Learning for Medical Image Captioning,"Ervine Zheng, Qi Yu",,,,,,,,, Compressed Decentralized Proximal Stochastic Gradient Method for Nonconvex Composite Problems with Heterogeneous Data,"Yonggui Yan, Jie Chen, Pin-Yu Chen, Xiaodong Cui, Songtao Lu, Yangyang Xu",http://arxiv.org/abs/2302.14252,,https://huggingface.co/papers/2302.14252,,,,2302.14252,6,0 A Reinforcement Learning Framework for Dynamic Mediation Analysis,"Lin Ge, Jitao Wang, Chengchun Shi, Zhenke Wu, Rui Song",http://arxiv.org/abs/2301.13348,,https://huggingface.co/papers/2301.13348,,,,2301.13348,5,0 Beyond Lipschitz Smoothness: A Tighter Analysis for Nonconvex Optimization,"Zhengmian Hu, Xidong Wu, Heng Huang",,,,,,,,, Fractional Denoising for 3D Molecular Pre-training,"Shikun Feng, Yuyan Ni, Yanyan Lan, Zhiming Ma, Wei-Ying Ma",,,,,,,,, Effective and Efficient Structural Inference with Reservoir Computing,"Aoran Wang, Tsz Pan Tong, Jun Pang",,,,,,,,, Text-To-4D Dynamic Scene Generation,"Uriel Singer, Shelly Sheynin, Adam Polyak, Oron Ashual, Iurii Makarov, Filippos Kokkinos, Naman Goyal, Andrea Vedaldi, Devi Parikh, Justin Johnson, Yaniv Taigman",http://arxiv.org/abs/2301.11280,,https://huggingface.co/papers/2301.11280,,,,2301.11280,11,2 Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning,"Evan Liu, Sahaana Suri, Tong Mu, Allan Zhou, Chelsea Finn",http://arxiv.org/abs/2306.08400,,https://huggingface.co/papers/2306.08400,,,,2306.08400,5,0 Vector Quantized Wasserstein Auto-Encoder,"Tung-Long Vuong, Trung Le, He Zhao, Chuanxia Zheng, Mehrtash Harandi, Jianfei Cai, Dinh Phung",http://arxiv.org/abs/2302.05917,,https://huggingface.co/papers/2302.05917,,,,2302.05917,7,0 MolDiff: Addressing the Atom-Bond Inconsistency Problem in 3D Molecule Diffusion Generation,"Xingang Peng, Jiaqi Guan, qiang liu, Jianzhu Ma",http://arxiv.org/abs/2305.07508,,https://huggingface.co/papers/2305.07508,,,,2305.07508,4,0 Multi-Fidelity Covariance Estimation in the Log-Euclidean Geometry,"Aimee Maurais, Terrence Alsup, Benjamin Peherstorfer, Youssef Marzouk",http://arxiv.org/abs/2301.13749,,https://huggingface.co/papers/2301.13749,,,,2301.13749,4,0 Active Learning based Structural Inference,"Aoran Wang, Jun Pang",,,,,,,,, Multi-Objective Population Based Training,"Arkadiy Dushatskiy, Alexander Chebykin, Tanja Alderliesten, Peter A.N Bosman",http://arxiv.org/abs/2306.01436,,https://huggingface.co/papers/2306.01436,,,,2306.01436,4,1 Contextual Conservative Interleaving Bandits,Kei Takemura,,,,,,,,, Unsupervised Skill Discovery for Learning Shared Structures across Changing Environments,"Sang-Hyun Lee, Seung-Woo Seo",,,,,,,,, MANSA: Learning Fast and Slow in Multi-Agent Systems,"David Mguni, Taher Jafferjee, Haojun Chen, Jianhong Wang, LONG FEI, Xidong Feng, Stephen Mcaleer, Feifei Tong, Jun Wang, Yaodong Yang",http://arxiv.org/abs/2302.05910,,https://huggingface.co/papers/2302.05910,,,,2302.05910,10,0 Team Belief DAG: Generalizing the Sequence Form to Team Games for Fast Computation of Correlated Team Max-Min Equilibria via Regret Minimization,"Brian Zhang, Gabriele Farina, Tuomas Sandholm",,,,,,,,, Kernel Sufficient Dimension Reduction and Variable Selection for Compositional Data via Amalgamation,"Junyoung Park, Jeongyoun Ahn, Cheolwoo Park",,,,,,,,, Data Poisoning Attacks Against Multimodal Encoders,"Ziqing Yang, Xinlei He, Zheng Li, Michael Backes, Mathias Humbert, Pascal Berrang, Yang Zhang",http://arxiv.org/abs/2209.15266,,https://huggingface.co/papers/2209.15266,,,,2209.15266,7,0 FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation,"Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon",,,,,,,,, Certified Robust Neural Networks: Generalization and Corruption Resistance,"Amine Bennouna, Ryan Lucas, Bart Van Parys",http://arxiv.org/abs/2303.02251,https://github.com/RyanLucas3/HR_Neural_Networks,https://huggingface.co/papers/2303.02251,,,,2303.02251,3,1 "Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective","Michael Sander, Joan Puigcerver, Josip Djolonga, Gabriel Peyré, Mathieu Blondel",,,,,,,,, Anti-Exploration by Random Network Distillation,"Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Sergey Kolesnikov",http://arxiv.org/abs/2301.13616,,https://huggingface.co/papers/2301.13616,,,,2301.13616,4,2 Monotonicity and Double Descent in Uncertainty Estimation with Gaussian Processes,"Liam Hodgkinson, Chris van der Heide, Fred Roosta, Michael Mahoney",http://arxiv.org/abs/2210.07612,,https://huggingface.co/papers/2210.07612,,,,2210.07612,4,1 Sampling-Based Accuracy Testing of Posterior Estimators for General Inference,"Pablo Lemos, Adam Coogan, Laurence Perreault-Levasseur, Yashar Hezaveh",http://arxiv.org/abs/2302.03026,,https://huggingface.co/papers/2302.03026,,,,2302.03026,4,1 Discrete Continuous Optimization Framework for Simultaneous Clustering and Training in Mixture Models,"Parth Sangani, Arjun Kashettiwar, Pritish Chakraborty, Bhuvan Gangula, Sivasubramanian Durga, Ganesh Ramakrishnan, Rishabh Iyer, Abir De",,,,,,,,, A General Theory for Federated Optimization with Asynchronous and Heterogeneous Clients Updates,"Yann Fraboni, Richard Vidal, Laetitia Kameni, Marco Lorenzi",http://arxiv.org/abs/2206.10189,,https://huggingface.co/papers/2206.10189,,,,2206.10189,4,0 Principled Offline RL in the Presence of Rich Exogenous Information,"Riashat Islam, Manan Tomar, Alex Lamb, Yonathan Efroni, Hongyu Zang, Aniket Didolkar, Dipendra Misra, Xin Li, Harm Seijen, Remi Tachet des Combes, John Langford",,,,,,,,, Reflected Diffusion Models,"Aaron Lou, Stefano Ermon",http://arxiv.org/abs/2304.04740,,https://huggingface.co/papers/2304.04740,,,,2304.04740,2,1 Speeding Up Bellman Ford via Minimum Violation Permutations,"Silvio Lattanzi, Ola Svensson, Sergei Vassilvitskii",,,,,,,,, STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition,"Yucheng Lu, Shivani Agrawal, Suvinay Subramanian, Oleg Rybakov, Chris De Sa, Amir Yazdanbakhsh",http://arxiv.org/abs/2302.01172,,https://huggingface.co/papers/2302.01172,,,,2302.01172,6,1 The Regret of Exploration and the Control of Bad Episodes in Reinforcement Learning,"Bruno Gaujal, Victor Boone",,,,,,,,, Exploring the Limits of Model-Targeted Indiscriminate Data Poisoning Attacks,"Yiwei Lu, Gautam Kamath, Yaoliang Yu",http://arxiv.org/abs/2303.03592,,https://huggingface.co/papers/2303.03592,,,,2303.03592,3,0 GuardHFL: Privacy Guardian for Heterogeneous Federated Learning,"Hanxiao Chen, Meng Hao, Hongwei Li, Kangjie Chen, Guowen Xu, Tianwei Zhang, Xilin Zhang",,,,,,,,, Performative Reinforcement Learning,"Debmalya Mandal, Stelios Triantafyllou, Goran Radanovic",http://arxiv.org/abs/2207.00046,,https://huggingface.co/papers/2207.00046,,,,2207.00046,3,0 Collaborative Causal Inference with Fair Incentives,"Rui Qiao, Xinyi Xu, Bryan Kian Hsiang Low",,,,,,,,, Communication-Constrained Bandits under Additive Gaussian Noise,"Prathamesh Mayekar, Jonathan Scarlett, Vincent Tan",http://arxiv.org/abs/2304.12680,,https://huggingface.co/papers/2304.12680,,,,2304.12680,3,0 Discover-Then-Rank Unlabeled Support Vectors in the Dual Space for Multi-Class Active Learning,"Dayou Yu, Weishi Shi, Qi Yu",,,,,,,,, Expectation-Complete Graph Representations with Homomorphisms,"Pascal Welke, Maximilian Thiessen, Fabian Jogl, Thomas Gärtner",http://arxiv.org/abs/2306.05838,,https://huggingface.co/papers/2306.05838,,,,2306.05838,4,1 Constrained Phi-Equilibria,"Matteo Castiglioni, Martino Bernasconi, Alberto Marchesi, Francesco Trovò, Nicola Gatti",http://arxiv.org/abs/2301.13600,,https://huggingface.co/papers/2301.13600,,,,2301.13600,5,0 Paging with Succinct Predictions,"Antonios Antoniadis, Joan Boyar, Marek Elias, Lene M Favrholdt, Ruben Hoeksma, Kim S. Larsen, Adam Polak, Bertrand Simon",http://arxiv.org/abs/2210.02775,,https://huggingface.co/papers/2210.02775,,,,2210.02775,8,0 What do CNNs Learn in the First Layer and Why? A Linear Systems Perspective,"Rhea Chowers, Yair Weiss",http://arxiv.org/abs/2206.02454,,https://huggingface.co/papers/2206.02454,,,,2206.02454,2,1 Streaming Submodular Maximization with Differential Privacy,"Anamay Chaturvedi, Huy Nguyen, Thy Nguyen",http://arxiv.org/abs/2210.14315,,https://huggingface.co/papers/2210.14315,,,,2210.14315,3,0 Optimization for Amortized Inverse Problems,"Tianci Liu, Tong Yang, Quan Zhang, Qi Lei",http://arxiv.org/abs/2210.13983,,https://huggingface.co/papers/2210.13983,,,,2210.13983,4,0 UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers,"Dachuan Shi, Chaofan Tao, Ying Jin, Zhendong Yang, Chun Yuan, Jiaqi Wang",http://arxiv.org/abs/2301.13741,https://github.com/sdc17/UPop,https://huggingface.co/papers/2301.13741,,,,2301.13741,6,0 On the Within-Group Fairness of Screening Classifiers,"Nastaran Okati, Stratis Tsirtsis, Manuel Gomez-Rodriguez",,,,,,,,, DiscoBAX - Discovery of optimal intervention sets in genomic experiment design,"Clare Lyle, Arash Mehrjou, Pascal Notin, Andrew Jesson, Stefan Bauer, Yarin Gal, Patrick Schwab",,,,,,,,, "A Fast, Well-Founded Approximation to the Empirical Neural Tangent Kernel","Mohamad Amin Mohamadi, Won Bae, Danica J Sutherland",http://arxiv.org/abs/2206.12543,,https://huggingface.co/papers/2206.12543,,,,2206.12543,3,0 Deterministic equivalent and error universality of deep random features learning,"Dominik Schröder, Hugo Cui, Daniil Dmitriev, Bruno Loureiro",http://arxiv.org/abs/2302.00401,,https://huggingface.co/papers/2302.00401,,,,2302.00401,4,1 Fighting Fire with Fire: Contrastive Debiasing without Bias-free Data via Generative Bias-transformation,"Yeonsung Jung, Hajin Shim, June Yong Yang, Eunho Yang",http://arxiv.org/abs/2112.01021,,https://huggingface.co/papers/2112.01021,,,,2112.01021,4,0 PaLM-E: An Embodied Multimodal Language Model,"Danny Driess, Pete Florence, Klaus Greff, Marc Toussaint, Igor Mordatch, Andy Zeng, Vincent Vanhoucke, Mehdi S. M. Sajjadi, Corey Lynch, Ayzaan Wahid, brian ichter, Fei Xia, Pierre Sermanet, Yevgen Chebotar, Jonathan Tompson, Wenlong Huang, Sergey Levine, Tianhe (Kevin) Yu, Karol Hausman, Quan Vuong, Aakanksha Chowdhery, Daniel Duckworth",,,,,,,,, Near-Optimal Cryptographic Hardness of Agnostically Learning Halfspaces and ReLU Regression under Gaussian Marginals,"Ilias Diakonikolas, Daniel Kane, Lisheng Ren",http://arxiv.org/abs/2302.06512,,https://huggingface.co/papers/2302.06512,,,,2302.06512,3,0 End-to-end Differentiable Clustering with Associative Memories,"Bishwajit Saha, Dmitry Krotov, Mohammed Zaki, Parikshit Ram",http://arxiv.org/abs/2306.03209,,https://huggingface.co/papers/2306.03209,,,,2306.03209,4,0 Effective Structured Prompting by Meta-Learning and Representative Verbalizer,"Weisen Jiang, Yu Zhang, James Kwok",http://arxiv.org/abs/2306.00618,,https://huggingface.co/papers/2306.00618,,,,2306.00618,3,0 Monotonic Location Attention for Length Generalization,"Jishnu Ray Chowdhury, Cornelia Caragea",http://arxiv.org/abs/2305.20019,,https://huggingface.co/papers/2305.20019,,,,2305.20019,2,1 Out-of-Distribution Generalization of Federated Learning via Implicit Invariant Relationships,"Yaming Guo, Kai Guo, Xiaofeng Cao, Tieru Wu, Yi Chang",,,,,,,,, NeuralStagger: Accelerating Physics-constrained Neural PDE Solver with Spatial-temporal Decomposition,"Xinquan Huang, Wenlei Shi, Qi Meng, Yue Wang, Xiaotian Gao, Jia Zhang, Tie-Yan Liu",http://arxiv.org/abs/2302.10255,,https://huggingface.co/papers/2302.10255,,,,2302.10255,7,0 Input Perturbation Reduces Exposure Bias in Diffusion Models,"Mang Ning, Enver Sangineto, Angelo Porrello, Simone Calderara, Rita Cucchiara",http://arxiv.org/abs/2301.11706,https://github.com/forever208/DDPM-IP,https://huggingface.co/papers/2301.11706,,,,2301.11706,5,1 Flash: Concept Drift Adaptation in Federated Learning,"Kunjal Panchal, Sunav Choudhary, Subrata Mitra, Koyel Mukherjee, Somdeb Sarkhel, Saayan Mitra, Hui Guan",,,,,,,,, Conformal Prediction Sets for Graph Neural Networks,"Soroush H. Zargarbashi, Simone Antonelli, Aleksandar Bojchevski",,,,,,,,, Probabilistic Attention-to-Influence Neural Models for Event Sequences,"Xiao Shou, DEBARUN BHATTACHARJYA, Tian Gao, Dharmashankar Subramanian, Oktie Hassanzadeh, Kristin Bennett",,,,,,,,, Nearly-tight Bounds for Deep Kernel Learning,"Yi-Fan Zhang, Min-Ling Zhang",,,,,,,,, Generalized Disparate Impact for Configurable Fairness Solutions in ML,"Luca Giuliani, Eleonora Misino, Michele Lombardi",http://arxiv.org/abs/2305.18504,,https://huggingface.co/papers/2305.18504,,,,2305.18504,3,1 Thompson Sampling with Less Exploration is Fast and Optimal,"Tianyuan Jin, XIANGLIN YANG, Xiaokui Xiao, Pan Xu",,,,,,,,, Do Machine Learning Models Learn Statistical Rules Inferred from Data?,"Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong",http://arxiv.org/abs/2303.01433,https://github.com/DebugML/sqrl,https://huggingface.co/papers/2303.01433,,,,2303.01433,4,1 Deep Perturbation Learning: Enhancing the Network Performance via Image Perturbations,"Zifan Song, Xiao Gong, Guosheng Hu, Cairong Zhao",,,,,,,,, Hypothesis Transfer Learning with Surrogate Classification Losses: Generalization Bounds through Algorithmic Stability,"Anass Aghbalou, Guillaume Staerman",,,,,,,,, Learning Controllable Degradation for Real-World Super-Resolution via Constrained Flows,"Seobin Park, Dongjin Kim, Sungyong Baik, Tae Hyun Kim",,,,,,,,, Few-bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction,"Goergii Novikov, Daniel Bershatsky, Julia Gusak, Alex Shonenkov, Denis Dimitrov, Ivan Oseledets",http://arxiv.org/abs/2202.00441,,https://huggingface.co/papers/2202.00441,,,,2202.00441,6,0 In Search for a Generalizable Method for Source Free Domain Adaptation,"Malik Boudiaf, tom denton, Bart van Merrienboer, Vincent Dumoulin, Eleni Triantafillou",http://arxiv.org/abs/2302.06658,,https://huggingface.co/papers/2302.06658,,,,2302.06658,5,1 GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency,"Min-Seop Kwak, Jiuhn Song, Seungryong Kim",http://arxiv.org/abs/2301.10941,,https://huggingface.co/papers/2301.10941,,,,2301.10941,3,2 Input uncertainty propagation through trained neural networks,"Paul Monchot, Loic Coquelin, Sébastien J. Petit, Sébastien Marmin, Erwann LE PENNEC, Nicolas Fischer",,,,,,,,, Optimally-weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference,"Ayush Bharti, Masha Naslidnyk, Oscar Key, Samuel Kaski, Francois-Xavier Briol",http://arxiv.org/abs/2301.11674,,https://huggingface.co/papers/2301.11674,,,,2301.11674,5,0 SGD with large step sizes learns sparse features,"Maksym Andriushchenko, Aditya Vardhan Varre, Loucas Pillaud-Vivien, Nicolas Flammarion",http://arxiv.org/abs/2210.05337,https://github.com/tml-epfl/sgd-sparse-features,https://huggingface.co/papers/2210.05337,,,,2210.05337,4,1 Kernel Logistic Regression Approximation of an Understandable ReLU Neural Network,"Marie Guyomard, Susana Barbosa, Lionel Fillatre",,,,,,,,, Cramming: Training a Language Model on a single GPU in one day.,"Jonas Geiping, Tom Goldstein",https://arxiv.org/abs//2212.14034,https://github.com/JonasGeiping/cramming,https://huggingface.co/papers/2212.14034,,https://huggingface.co/JonasGeiping/crammed-bert,https://huggingface.co/datasets/JonasGeiping/the_pile_WordPiecex32768_2efdb9d060d1ae95faf952ec1a50f020,2212.14034,2,1 A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models,"James Allingham, JIE REN, Michael Dusenberry, Jeremiah Liu, Xiuye Gu, Yin Cui, Dustin Tran, Balaji Lakshminarayanan",http://arxiv.org/abs/2302.06235,,https://huggingface.co/papers/2302.06235,,,,2302.06235,8,2 Trompt: Towards a Better Deep Neural Network for Tabular Data,"Kuan-Yu Chen, Ping-Han Chiang, Hsin-Rung Chou, Ting-Wei Chen, Tien-Hao Chang",http://arxiv.org/abs/2305.18446,,https://huggingface.co/papers/2305.18446,,,,2305.18446,5,0 Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits,"Yunlong Hou, Vincent Tan, Zixin Zhong",http://arxiv.org/abs/2301.13393,,https://huggingface.co/papers/2301.13393,,,,2301.13393,3,0 Online Restless Bandits with Unobserved States,"BOWEN JIANG, Bo Jiang, Jian Li, TAO LIN, Xinbing Wang, Chenghu Zhou",,,,,,,,, Explore and Exploit the Diverse Knowledge in Model Zoo for Domain Generalization,"Yimeng Chen, Tianyang Hu, Fengwei Zhou, Zhiming Ma, Zhenguo Li",http://arxiv.org/abs/2306.02595,,https://huggingface.co/papers/2306.02595,,,,2306.02595,5,0 The case for 4-bit precision: k-bit Inference Scaling Laws,"Tim Dettmers, Luke Zettlemoyer",,,,,,,,, Network Effects in Performative Prediction Games,"Xiaolu Wang, Chung-Yiu Yau, Hoi To Wai",,,,,,,,, Uncertainty Estimation by Fisher Information-based Evidential Deep Learning,"Danruo Deng, Guangyong Chen, Yang YU, Furui Liu, Pheng Ann Heng",http://arxiv.org/abs/2303.02045,,https://huggingface.co/papers/2303.02045,,,,2303.02045,5,0 Robust and Scalable Bayesian Online Changepoint Detection,"Matias Altamirano, Francois-Xavier Briol, Jeremias Knoblauch",http://arxiv.org/abs/2302.04759,,https://huggingface.co/papers/2302.04759,,,,2302.04759,3,0 Covariate balancing using the integral probability metric for causal inference,"Insung Kong, Yuha Park, Joonhyuk Jung, Kwonsang Lee, Yongdai Kim",http://arxiv.org/abs/2305.13715,,https://huggingface.co/papers/2305.13715,,,,2305.13715,5,0 Provable Benefit of Mixup for Finding Optimal Decision Boundaries,"Junsoo Oh, Chulhee Yun",http://arxiv.org/abs/2306.00267,,https://huggingface.co/papers/2306.00267,,,,2306.00267,2,0 Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts,"Dirk van der Hoeven, Ciara Pike-Burke, Hao Qiu, Nicolò Cesa-Bianchi",,,,,,,,, Learning the Right Layers a Data-Driven Layer-Aggregation Strategy for Semi-Supervised Learning on Multilayer Graphs,"Sara Venturini, Andrea Cristofari, Francesco Rinaldi, Francesco Tudisco",,,,,,,,, Towards Stable and Efficient Adversarial Training against $l_1$ Bounded Adversarial Attacks,"Yulun Jiang, Chen Liu, Zhichao Huang, Mathieu Salzmann, Sabine Süsstrunk",,,,,,,,, Recovery Bounds on Class-Based Optimal Transport: A Sum-of-Norms Regularization Framework,"Arman Rahbar, Ashkan Panahi, Morteza Haghir Chehreghani, Devdatt Dubhashi, Hamid Krim",http://arxiv.org/abs/1903.03850,,https://huggingface.co/papers/1903.03850,,,,1903.03850,5,0 Robust Explanation for Free or At the Cost of Faithfulness,"Zeren Tan, Yang Tian",,,,,,,,, Adversarial Learning of Distributional Reinforcement Learning,"Yang Sui, Yukun Huang, Hongtu Zhu, Fan Zhou",,,,,,,,, Reinforcement Learning Can Be More Efficient with Multiple Rewards,"Christoph Dann, Yishay Mansour, Mehryar Mohri",,,,,,,,, On Generalizations of Some Distance Based Classifiers for HDLSS Data,"Sarbojit Roy, Soham Sarkar, Subhajit Dutta, Anil K. Ghosh",http://arxiv.org/abs/1902.03295,,https://huggingface.co/papers/1902.03295,,,,1902.03295,4,0 A Category-theoretical Meta-analysis of Definitions of Disentanglement,"Yivan Zhang, Masashi Sugiyama",http://arxiv.org/abs/2305.06886,,https://huggingface.co/papers/2305.06886,,,,2305.06886,2,0 When does Privileged information Explain Away Label Noise?,"Guillermo Ortiz Jimenez, Mark Collier, Anant Nawalgaria, Alexander D'Amour, Jesse Berent, Rodolphe Jenatton, Efi Kokiopoulou",http://arxiv.org/abs/2303.01806,,https://huggingface.co/papers/2303.01806,,,,2303.01806,7,0 On the Initialization of Graph Neural Networks,"Jiahang Li, Yakun Song, Xiang song, David Wipf",,,,,,,,, Random Matrix Analysis to Balance between Supervised and Unsupervised Learning under the Low Density Separation Assumption,"Vasilii Feofanov, Malik TIOMOKO, Aladin Virmaux",,,,,,,,, Margin-based sampling in high dimensions: When being active is less efficient than staying passive,"Alexandru Tifrea, Jacob Clarysse, Fanny Yang",http://arxiv.org/abs/2212.00772,,https://huggingface.co/papers/2212.00772,,,,2212.00772,3,0 Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources,"Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang",http://arxiv.org/abs/2306.08364,,https://huggingface.co/papers/2306.08364,,,,2306.08364,4,0 InGram: Inductive Knowledge Graph Embedding via Relation Graphs,"Jaejun Lee, Chanyoung Chung, Joyce Whang",http://arxiv.org/abs/2305.19987,,https://huggingface.co/papers/2305.19987,,,,2305.19987,3,0 Nearly Optimal Competitive Ratio for Online Allocation Problems with Two-sided Resource Constraints and Finite Requests,"Qixin Zhang, Wenbing Ye, Zaiyi Chen, Haoyuan Hu, Enhong Chen, Yu Yang",,,,,,,,, Differentially Private Distributed Bayesian Linear Regression with MCMC,"Barış Alparslan, Sinan Yıldırım, Ilker Birbil",http://arxiv.org/abs/2301.13778,,https://huggingface.co/papers/2301.13778,,,,2301.13778,3,0 Bilevel Optimization with Coupled Decision-dependent Distributions,Songtao Lu,,,,,,,,, On Heterogeneous Treatment Effects in Heterogeneous Causal Graphs,"Richard Watson, Hengrui Cai, Xinming An, Samuel McLean, Rui Song",http://arxiv.org/abs/2301.12383,,https://huggingface.co/papers/2301.12383,,,,2301.12383,5,1 Near-Optimal Quantum Coreset Construction Algorithms for Clustering,"Yecheng Xue, Xiaoyu Chen, Tongyang Li, Shaofeng Jiang",http://arxiv.org/abs/2306.02826,,https://huggingface.co/papers/2306.02826,,,,2306.02826,4,0 Rethinking Weak Supervision in Helping Contrastive Representation Learning,"Jingyi Cui, Weiran Huang, Yifei Wang, Yisen Wang",,,,,,,,, "Neuro-Symbolic Continual Learning: Knowledge, Reasoning Shortcuts and Concept Rehearsal","Emanuele Marconato, Gianpaolo Bontempo, ELISA FICARRA, Simone Calderara, Andrea Passerini, Stefano Teso",,,,,,,,, Robust Situational Reinforcement Learning in Face of Context Disturbances,"Jinpeng Zhang, Yufeng Zheng, Chuheng Zhang, Li Zhao, Lei Song, Yuan Zhou, Jiang Bian",,,,,,,,, Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation,"Shengcao Cao, Mengtian Li, James Hays, Deva Ramanan, Yu-Xiong Wang, Liangyan Gui",,,,,,,,, A Critical View of Vision-Based Long-Term Dynamics Prediction Under Environment Misalignment,"Hanchen Xie, Jiageng Zhu, Mahyar Khayatkhoei, Jiazhi Li, Mohamed Hussein, Wael AbdAlmageed",http://arxiv.org/abs/2305.07648,,https://huggingface.co/papers/2305.07648,,,,2305.07648,6,0 Feature Programming for Multivariate Time Series Prediction,"Alex Reneau, Jerry Yao-Chieh Hu, Ammar Gilani, Han Liu",http://arxiv.org/abs/2306.06252,,https://huggingface.co/papers/2306.06252,,,,2306.06252,6,0 Neural signature kernels as infinite-width-depth-limits of controlled ResNets,"Nicola Muca Cirone, Maud Lemercier, Cristopher Salvi",http://arxiv.org/abs/2303.17671,,https://huggingface.co/papers/2303.17671,,,,2303.17671,3,0 Distribution Free Domain Generalization,"Peifeng Tong, Wu Su, He Li, Jialin Ding, zhan haoxiang, Song Chen",,,,,,,,, Coin Sampling: Gradient-Based Bayesian Inference without Learning Rates,"Louis Sharrock, Chris Nemeth",http://arxiv.org/abs/2301.11294,,https://huggingface.co/papers/2301.11294,,,,2301.11294,2,1 Naive imputation implicitly regularizes high-dimensional linear models,"Alexis Ayme, Claire Boyer, Aymeric Dieuleveut, Erwan Scornet",http://arxiv.org/abs/2301.13585,,https://huggingface.co/papers/2301.13585,,,,2301.13585,4,0 Explaining Reinforcement Learning with Shapley Values,"Daniel Beechey, Thomas M. S. Smith, Ozgur Simsek",http://arxiv.org/abs/2306.05810,,https://huggingface.co/papers/2306.05810,,,,2306.05810,3,0 Fisher Information Embedding for Node and Graph Learning,"Dexiong Chen, Paolo Pellizzoni, Karsten Borgwardt",http://arxiv.org/abs/2305.07580,https://github.com/BorgwardtLab/fisher_information_embedding,https://huggingface.co/papers/2305.07580,,,,2305.07580,3,1 Shortest Edit Path Crossover: A Theory-driven Solution to the Permutation Problem in Evolutionary Neural Architecture Search,"Xin Qiu, Risto Miikkulainen",http://arxiv.org/abs/2210.14016,,https://huggingface.co/papers/2210.14016,,,,2210.14016,2,1 Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning,"Nicolas Castanet, Olivier Sigaud, sylvain lamprier",http://arxiv.org/abs/2206.06719,,https://huggingface.co/papers/2206.06719,,,,2206.06719,3,1 A Unifying Framework to the Analysis of Interaction Methods using Synergy Functions,"Daniel Lundstrom, Meisam Razaviyayn",,,,,,,,, Predicting Ordinary Differential Equations with Transformers,"Sören Becker, Michal Klein, Alexander Neitz, Giambattista Parascandolo, Niki Kilbertus",,,,,,,,, SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models,"Shenghua Wan, Yucen Wang, Minghao Shao, Ruying Chen, De-Chuan Zhan",,,,,,,,, Optimal Convergence Rates for Agnostic Nystrom Kernel Learning,"Jian Li, Yong Liu, Weiping Wang",,,,,,,,, Forward-Backward Gaussian Variational Inference via JKO in the Bures-Wasserstein Space,"Michael Diao, Krishna Balasubramanian, Sinho Chewi, Adil Salim",http://arxiv.org/abs/2304.05398,,https://huggingface.co/papers/2304.05398,,,,2304.05398,4,0 DUET: 2D Structured and Approximately Equivariant Representations,"Xavi Suau, Federico Danieli, T. Anderson Keller, Arno Blaas, Chen Huang, Jason Ramapuram, Dan Busbridge, Luca Zappella",,,,,,,,, The Persistent Laplacian for Data Science: Evaluating Higher-Order Persistent Spectral Representations of Data,"Thomas Davies, Zhengchao Wan, Ruben Sanchez-Garcia",,,,,,,,, Towards Understanding Generalization of Macro-AUC in Multi-label Learning,"Guoqiang Wu, Chongxuan Li, Yilong Yin",http://arxiv.org/abs/2305.05248,,https://huggingface.co/papers/2305.05248,,,,2305.05248,3,0 The Acquisition of Physical Knowledge in Generative Neural Networks,"Luca M. Schulze Buschoff, Eric Schulz, Marcel Binz",,,,,,,,, Sampling random graph homomorphisms and applications to network data analysis,"Hanbaek Lyu, Facundo Memoli, David Sivakoff",http://arxiv.org/abs/1910.09483,,https://huggingface.co/papers/1910.09483,,,,1910.09483,3,0 K-SHAP: Policy Clustering Algorithm for Anonymous Multi-Agent State-Action Pairs,"Andrea Coletta, Svitlana Vyetrenko, Tucker Balch",,,,,,,,, Identification of the Adversary from a Single Adversarial Example,"Minhao Cheng, Rui Min, Haochen Sun, Pin-Yu Chen",,,,,,,,, Unveiling the Latent Space Geometry of Push-Forward Generative Models,"Thibaut Issenhuth, Ugo Tanielian, Jeremie Mary, David Picard",http://arxiv.org/abs/2207.10541,,https://huggingface.co/papers/2207.10541,,,,2207.10541,4,1 Investigating the Role of Model-Based Learning in Exploration and Transfer,"Jacob C Walker, Eszter Vértes, Yazhe Li, Gabriel Dulac-Arnold, Ankesh Anand, Jessica Hamrick, Theophane Weber",http://arxiv.org/abs/2302.04009,,https://huggingface.co/papers/2302.04009,,,,2302.04009,7,0 "Differential Privacy, Linguistic Fairness, and Training Data Influence: Impossibility and Possibility Theorems for Multilingual Language Models","Phillip Rust, Anders Søgaard",,,,,,,,, Provable Copyright Protection for Generative Models,"Nikhil Vyas, Sham Kakade, Boaz Barak",http://arxiv.org/abs/2302.10870,,https://huggingface.co/papers/2302.10870,,,,2302.10870,3,1 Primal and Dual Analysis of Entropic Fictitious Play for Finite-sum Problems,"Atsushi Nitanda, Kazusato Oko, Denny Wu, Nobuhito Takenouchi, Taiji Suzuki",http://arxiv.org/abs/2303.02957,,https://huggingface.co/papers/2303.02957,,,,2303.02957,5,0 Online Local Differential Private Quantile Inference via Self-normalization,"Yi Liu, Qirui Hu, Lei Ding, Linglong Kong",,,,,,,,, Learnability and Algorithm for Continual Learning,"Gyuhak Kim, Changnan Xiao, Tatsuya Konishi, Bing Liu",,,,,,,,, A Conditional Normalizing Flow for Accelerated Multi-Coil MR Imaging,"Jeffrey Wen, Rizwan Ahmad, Phillip Schniter",http://arxiv.org/abs/2306.01630,https://github.com/jwen307/mri_cnf,https://huggingface.co/papers/2306.01630,,,,2306.01630,3,1 Compressing Tabular Data via Latent Variable Estimation,"Andrea Montanari, Eric Weiner",http://arxiv.org/abs/2302.09780,,https://huggingface.co/papers/2302.09780,,,,2302.09780,2,0 Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs,"Kaiwen Zheng, Cheng Lu, Jianfei Chen, Jun Zhu",http://arxiv.org/abs/2305.03935,,https://huggingface.co/papers/2305.03935,,,,2305.03935,4,0 Generalized-Smooth Nonconvex Optimization is As Efficient As Smooth Nonconvex Optimization,"Ziyi Chen, Yi Zhou, Yingbin LIANG, Zhaosong Lu",http://arxiv.org/abs/2303.02854,,https://huggingface.co/papers/2303.02854,,,,2303.02854,4,0 Efficient displacement convex optimization with particle gradient descent,"Hadi Daneshmand, Jason Lee, Chi Jin",http://arxiv.org/abs/2302.04753,,https://huggingface.co/papers/2302.04753,,,,2302.04753,3,0 Representer Point Selection for Explaining Regularized High-dimensional Models,"Che-Ping Tsai, Jiong Zhang, Hsiang-Fu Yu, Eli Chien, Cho-Jui Hsieh, Pradeep Ravikumar",http://arxiv.org/abs/2305.20002,,https://huggingface.co/papers/2305.20002,,,,2305.20002,6,0 Generative Pretraining for Offline Model-based Optimization,"Satvik Mashkaria, Siddarth Krishnamoorthy, Aditya Grover",,,,,,,,, Provable Multi-instance Deep AUC Maximization with Stochastic Pooling,"Dixian Zhu, Bokun Wang, Zhi Chen, Yaxing Wang, Milan Sonka, Xiaodong Wu, Tianbao Yang",http://arxiv.org/abs/2305.08040,,https://huggingface.co/papers/2305.08040,,,,2305.08040,7,0 RGE: A Repulsive Graph Rectification for Node Classification via Influence,"Jaeyun Song, Sungyub Kim, Eunho Yang",,,,,,,,, On the Relationship Between Explanation and Prediction: A Causal View,"Amir-Hossein Karimi, Krikamol Muandet, Simon Kornblith, Bernhard Schölkopf, Been Kim",http://arxiv.org/abs/2212.06925,,https://huggingface.co/papers/2212.06925,,,,2212.06925,5,0 MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation,"Omer Bar-Tal, Lior Yariv, Yaron Lipman, Tali Dekel",http://arxiv.org/abs/2302.08113,,https://huggingface.co/papers/2302.08113,,,,2302.08113,4,1 LIV: Language-Image Representations and Rewards for Robotic Control,"Yecheng Jason Ma, Vikash Kumar, Amy Zhang, Osbert Bastani, Dinesh Jayaraman",http://arxiv.org/abs/2306.00958,,https://huggingface.co/papers/2306.00958,,,,2306.00958,7,1 Improving $\ell_1$-Certified Robustness via Randomized Smoothing by Leveraging Box Constraints,"Václav Voráček, Matthias Hein",,,,,,,,, Representation-Driven Reinforcement Learning,"Ofir Nabati, Guy Tennenholtz, Shie Mannor",http://arxiv.org/abs/2305.19922,,https://huggingface.co/papers/2305.19922,,,,2305.19922,3,0 Distribution Free Prediction Sets for Node Classification,Jase Clarkson,http://arxiv.org/abs/2211.14555,,https://huggingface.co/papers/2211.14555,,,,2211.14555,1,0 Generalized Reductions: Making any Hierarchical Clustering Fair and Balanced with Low Cost,"Marina Knittel, Max Springer, John P Dickerson, MohammadTaghi Hajiaghayi",http://arxiv.org/abs/2205.14198,,https://huggingface.co/papers/2205.14198,,,,2205.14198,4,1 Few-Sample Feature Selection via Feature Manifold Learning,"David Cohen, Tal Shnitzer, Yuval Kluger, Ronen Talmon",,,,,,,,, Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces,"Javier E. Santos, Zachary Fox, Nicholas Lubbers, Yen Ting Lin",http://arxiv.org/abs/2305.11089,,https://huggingface.co/papers/2305.11089,,,,2305.11089,4,0 Fair Neighbor Embedding,"Jaakko Peltonen, Wen Xu, Timo Nummenmaa, Jyrki Nummenmaa",,,,,,,,, Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup,"Muthu Chidambaram, Xiang Wang, Chenwei Wu, Rong Ge",http://arxiv.org/abs/2210.13512,,https://huggingface.co/papers/2210.13512,,,,2210.13512,4,0 Fast as CHITA: Neural Network Pruning with Combinatorial Optimization,"Riade Benbaki, Wenyu Chen, Xiang Meng, Hussein Hazimeh, Natalia Ponomareva, Zhe Zhao, Rahul Mazumder",http://arxiv.org/abs/2302.14623,,https://huggingface.co/papers/2302.14623,,,,2302.14623,7,0 Revisiting Data-Free Knowledge Distillation with Poisoned Teachers,"Junyuan Hong, Yi Zeng, Shuyang Yu, Lingjuan Lyu, Ruoxi Jia, Jiayu Zhou",http://arxiv.org/abs/2306.02368,https://github.com/illidanlab/ABD,https://huggingface.co/papers/2306.02368,,,,2306.02368,6,1 RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolution,"Pengyi Li, Hongyao Tang, Jianye Hao, Yan Zheng, Xian Fu",,,,,,,,, Toward Efficient Grad-Based Value Estimation,Arsalan Sharifnassab,,,,,,,,, Coupled Variational Autoencoder,"Xiaoran Hao, Patrick Shafto, Patrick Shafto",http://arxiv.org/abs/2306.02565,,https://huggingface.co/papers/2306.02565,,,,2306.02565,2,0 Federated Heavy Hitter Recovery under Linear Sketching,"Adria Gascon, Peter Kairouz, Ziteng Sun, Ananda Suresh",,,,,,,,, Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion,"Ashok Cutkosky, Harsh Mehta, Francesco Orabona",http://arxiv.org/abs/2302.03775,,https://huggingface.co/papers/2302.03775,,,,2302.03775,3,1 Supervised Metric Learning to Rank for Retrieval via Contextual Similarity Optimization,"Christopher Liao, Theodoros Tsiligkaridis, Brian Kulis",http://arxiv.org/abs/2210.01908,https://github.com/Chris210634/metric-learning-using-contextual-similarity,https://huggingface.co/papers/2210.01908,,,,2210.01908,3,1 User-defined Event Sampling and Uncertainty Quantification in Diffusion Models for Physical Dynamical Systems,"Marc Finzi, Anudhyan Boral, Leonardo Zepeda-Nunez, Andrew Wilson, Fei Sha",http://arxiv.org/abs/2306.07526,,https://huggingface.co/papers/2306.07526,,,,2306.07526,5,0 Are Random Decompositions all we need in High Dimensional Bayesian Optimisation?,"Juliusz Ziomek, Haitham Bou Ammar",http://arxiv.org/abs/2301.12844,,https://huggingface.co/papers/2301.12844,,,,2301.12844,2,1 Accelerated Primal-Dual Methods for Convex-Strongly-Concave Saddle Point Problems,"Mohammad Khalafi, Digvijay Boob",http://arxiv.org/abs/2209.04604,,https://huggingface.co/papers/2209.04604,,,,2209.04604,2,0 "Extending Kernel PCA through Dualization: Sparsity, Robustness and Fast Algorithms","Francesco Tonin, Alex Lambert, Panagiotis Patrinos, Johan Suykens",http://arxiv.org/abs/2306.05815,,https://huggingface.co/papers/2306.05815,,,,2306.05815,4,0 Improved Analysis of Score-based Generative Modeling: User-Friendly Bounds under Minimal Smoothness Assumptions,"Hongrui Chen, Holden Lee, Jianfeng Lu",http://arxiv.org/abs/2211.01916,,https://huggingface.co/papers/2211.01916,,,,2211.01916,3,1 Properties of the Mallows Model Depending on the Number of Alternatives: A Warning for an Experimentalist,"Niclas Boehmer, Piotr Faliszewski, Sonja Kraiczy",,,,,,,,, Linear CNNs Discover the Statistical Structure of the Dataset Using only the Most Dominant Frequencies,"Hannah Pinson, Joeri Lenaerts, Vincent Ginis",http://arxiv.org/abs/2303.02034,,https://huggingface.co/papers/2303.02034,,,,2303.02034,3,0 On Preemption and Learning in Stochastic Scheduling,"Nadav Merlis, Hugo Richard, Flore Sentenac, Corentin Odic, Mathieu Molina, Vianney Perchet",http://arxiv.org/abs/2205.15695,,https://huggingface.co/papers/2205.15695,,,,2205.15695,6,0 Joint Implicit Neural Representations for Global-Scale Species Mapping,"Elijah Cole, Grant Horn, Christian Lange, Alexander Shepard, Patrick Leary, Pietro Perona, Scott Loarie, Oisin Mac Aodha",,,,,,,,, TGRL: Teacher Guided Reinforcement Learning Algorithm,"Idan Shenfeld, Zhang-Wei Hong, Aviv Tamar, Pulkit Agrawal",,,,,,,,, Regions of Reliability in the Evaluation of Multivariate Probabilistic Forecasts,"Étienne Marcotte, Valentina Zantedeschi, Alexandre Drouin, Nicolas Chapados",http://arxiv.org/abs/2304.09836,,https://huggingface.co/papers/2304.09836,,,,2304.09836,4,1 DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning,"Zifeng Wang, Zheng Zhan, Yifan Gong, Yucai Shao, Stratis Ioannidis, Yanzhi Wang, Jennifer Dy",http://arxiv.org/abs/2305.00380,,https://huggingface.co/papers/2305.00380,,,,2305.00380,7,1 Mitigating Propagation Failures in Physics-informed Neural Networks using Retain-Resample-Release (R3) Sampling,"Arka Daw, Jie Bu, Sifan Wang, Paris Perdikaris, Anuj Karpatne",http://arxiv.org/abs/2207.02338,,https://huggingface.co/papers/2207.02338,,,,2207.02338,5,1 Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models,"Nikolaos Dimitriadis, Pascal Frossard, François Fleuret",http://arxiv.org/abs/2210.09759,,https://huggingface.co/papers/2210.09759,,,,2210.09759,3,1 Approximate Stein Classes for Truncated Density Estimation,"Daniel J. Williams, Song Liu",http://arxiv.org/abs/2306.00602,,https://huggingface.co/papers/2306.00602,,,,2306.00602,2,1 Extending Conformal Prediction to Hidden Markov Models with Exact Validity via de Finetti's Theorem for Markov Chains,"Buddhika Nettasinghe, Samrat Chatterjee, Ramakrishna Tipireddy, Mahantesh Halappanavar",http://arxiv.org/abs/2210.02271,,https://huggingface.co/papers/2210.02271,,,,2210.02271,4,0 Generative Decoding of Visual Stimuli,"Eleni Miliotou, Panagiotis Kyriakis, Jason Hinman, Andrei Irimia, Paul Bogdan",,,,,,,,, Training-Free Neural Active Learning with Initialization-Robustness Guarantees,"Apivich Hemachandra, Zhongxiang Dai, Jasraj Singh, See-Kiong Ng, Bryan Kian Hsiang Low",http://arxiv.org/abs/2306.04454,,https://huggingface.co/papers/2306.04454,,,,2306.04454,5,0 Unit Scaling: Out-of-the-Box Low-Precision Training,"Charlie Blake, Charlie Blake, Douglas Orr, Carlo Luschi",http://arxiv.org/abs/2303.11257,,https://huggingface.co/papers/2303.11257,,,,2303.11257,3,2 NUNO: A General Framework for Learning Parametric PDEs with Non-Uniform Data,"LIU SONGMING, Zhongkai Hao, Chengyang Ying, Hang Su, Ze Cheng, Jun Zhu",http://arxiv.org/abs/2305.18694,https://github.com/thu-ml/NUNO,https://huggingface.co/papers/2305.18694,,,,2305.18694,6,0 Diffusion Models for Offline Black-Box Optimization,"Siddarth Krishnamoorthy, Satvik Mashkaria, Aditya Grover",,,,,,,,, The Flan Collection: Designing Data and Methods for Effective Instruction Tuning,"Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc Le, Barret Zoph, Jason Wei, Adam Roberts",http://arxiv.org/abs/2301.13688,https://github.com/google-research/FLAN/tree/main/flan/v2,https://huggingface.co/papers/2301.13688,,,,2301.13688,11,2 Compositional Score Modeling for Simulation-Based Inference,"Tomas Geffner, George Papamakarios, Andriy Mnih",http://arxiv.org/abs/2209.14249,,https://huggingface.co/papers/2209.14249,,,,2209.14249,3,0 Dirichlet Diffusion Score Model for Biological Sequence Generation,"Pavel Avdeyev, Chenlai Shi, Yuhao Tan, Kseniia Dudnyk, Jian Zhou",http://arxiv.org/abs/2305.10699,,https://huggingface.co/papers/2305.10699,,,,2305.10699,5,0 Leveraging Proxy of Training Data for Test-Time Adaptation,"Juwon Kang, Nayeong Kim, Donghyeon Kwon, Jungseul Ok, Suha Kwak",,,,,,,,, Near-Optimal Algorithms for Private Online Optimization in the Realizable Regime,"Hilal Asi, Vitaly Feldman, Tomer Koren, Kunal Talwar",http://arxiv.org/abs/2302.14154,,https://huggingface.co/papers/2302.14154,,,,2302.14154,4,0 Double-Weighting for Covariate Shift Adaptation,"José I. Segovia-Martín, Santiago Mazuelas, Anqi Liu",http://arxiv.org/abs/2305.08637,,https://huggingface.co/papers/2305.08637,,,,2305.08637,3,0 Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints,"Donghao Li, Ruiquan Huang, Cong Shen, Jing Yang",http://arxiv.org/abs/2306.06265,,https://huggingface.co/papers/2306.06265,,,,2306.06265,4,1 PASTA: Pessimistic Assortment Optimization,"Juncheng Dong, Weibin Mo, Zhengling Qi, Cong Shi, Ethan Fang, Vahid Tarokh",http://arxiv.org/abs/2302.03821,,https://huggingface.co/papers/2302.03821,,,,2302.03821,6,0 Coarse-to-Fine: a Hierarchical Diffusion Model for Molecule Generation in 3D,"Bo Qiang, Yuxuan Song, Minkai Xu, Jingjing Gong, Bowen Gao, Hao Zhou, Wei-Ying Ma, Yanyan Lan",,,,,,,,, Off-Policy Average Reward Actor-Critic with Deterministic Policy Search,"Naman Saxena, Subhojyoti Khastagir, Shishir Nadubettu Yadukumar, Shalabh Bhatnagar",http://arxiv.org/abs/2305.12239,,https://huggingface.co/papers/2305.12239,,,,2305.12239,4,1 On Kinetic Optimal Probability Paths for Generative Models,"Neta Shaul, Ricky T. Q. Chen, Maximilian Nickel, Matthew Le, Yaron Lipman",http://arxiv.org/abs/2306.06626,,https://huggingface.co/papers/2306.06626,,,,2306.06626,5,0 General Covariance Data Augmentation for Neural PDE Solvers,"Fanaskov Vladimir, Tianchi Yu, Alexander Rudikov, Ivan Oseledets",http://arxiv.org/abs/2301.12730,,https://huggingface.co/papers/2301.12730,,,,2301.12730,4,0 Towards Bridging the Gaps between the Right to Explanation and the Right to be Forgotten,"Satyapriya Krishna, Jiaqi Ma, Himabindu Lakkaraju",http://arxiv.org/abs/2302.04288,,https://huggingface.co/papers/2302.04288,,,,2302.04288,3,0 A/B Testing in Network Data with Covariate-Adaptive Randomization,"Jialu Wang, Ping Li, Feifang Hu",,,,,,,,, Multi-Task Differential Privacy Under Distribution Skew,"Walid Krichene, Prateek Jain, Shuang Song, Abhradeep Guha Thakurta, Li Zhang",http://arxiv.org/abs/2302.07975,,https://huggingface.co/papers/2302.07975,,,,2302.07975,6,0 Node Embedding from Neural Hamiltonian Orbits in Graph Neural Networks,"Qiyu Kang, Kai Zhao, Yang Song, Sijie Wang, Wee Peng Tay",http://arxiv.org/abs/2305.18965,https://github.com/zknus/Hamiltonian-GNN,https://huggingface.co/papers/2305.18965,,,,2305.18965,5,0 LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation,"Yixiao Li, Yifan Yu, Qingru Zhang, Chen Liang, Pengcheng He, Weizhu Chen, Tuo Zhao",,,,,,,,, Measuring the Impact of Programming Language Distribution,"Gabriel Orlanski, Kefan Xiao, Xavier Garcia, Jeffrey Hui, Joshua Howland, Jonathan Malmaud, Jacob Austin, Rishabh Singh, Michele Catasta",http://arxiv.org/abs/2302.01973,,https://huggingface.co/papers/2302.01973,,,,2302.01973,9,1 Principled Acceleration of Iterative Numerical Methods Using Machine Learning,"Sohei Arisaka, Qianxiao Li",http://arxiv.org/abs/2206.08594,,https://huggingface.co/papers/2206.08594,,,,2206.08594,2,0 Graph Inductive Biases in Transformers without Message Passing,"Liheng Ma, Chen Lin, Derek Lim, Adriana Romero Soriano, Puneet Dokania, Mark Coates, Phil Torr, Ser Nam Lim",http://arxiv.org/abs/2305.17589,,https://huggingface.co/papers/2305.17589,,,,2305.17589,8,1 Alternately Optimized Graph Neural Networks,"Haoyu Han, Xiaorui Liu, Haitao Mao, MohamadAli Torkamani, Feng Shi, Victor Lee, Jiliang Tang",http://arxiv.org/abs/2206.03638,,https://huggingface.co/papers/2206.03638,,,,2206.03638,7,0 Looped Transformers as Programmable Computers,"Angeliki Giannou, Shashank Rajput, Jy-yong Sohn, Kangwook Lee, Jason Lee, Dimitris Papailiopoulos",http://arxiv.org/abs/2301.13196,,https://huggingface.co/papers/2301.13196,,,,2301.13196,6,0 Active causal structure learning with advice,"Davin Choo, Themis Gouleakis, Arnab Bhattacharyya",http://arxiv.org/abs/2305.19588,,https://huggingface.co/papers/2305.19588,,,,2305.19588,3,0 Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities,"Brian R. Bartoldson, Bhavya Kailkhura, Davis Blalock",http://arxiv.org/abs/2210.06640,,https://huggingface.co/papers/2210.06640,,,,2210.06640,3,1 "In Search of Insights, Not Magic Bullets: Towards Demystification of the Model Selection Dilemma in Heterogeneous Treatment Effect Estimation","Alicia Curth, Mihaela van der Schaar",http://arxiv.org/abs/2302.02923,,https://huggingface.co/papers/2302.02923,,,,2302.02923,2,0 GLOBE-CE: A Translation Based Approach for Global Counterfactual Explanations,"Dan Ley, Saumitra Mishra, Daniele Magazzeni",,,,,,,,, Predictive Flows for Faster Ford-Fulkerson,"Sami Davies, Benjamin Moseley, Sergei Vassilvitskii, Yuyan Wang",http://arxiv.org/abs/2303.00837,,https://huggingface.co/papers/2303.00837,,,,2303.00837,4,0 "Masked Trajectory Models for Prediction, Representation, and Control","Philipp Wu, Arjun Majumdar, Kevin Stone, Yixin Lin, Igor Mordatch, Pieter Abbeel, Aravind Rajeswaran",http://arxiv.org/abs/2305.02968,https://github.com/facebookresearch/mtm,https://huggingface.co/papers/2305.02968,,,,2305.02968,7,4 DRew: Dynamically Rewired Message Passing with Delay,"Benjamin Gutteridge, Xiaowen Dong, Michael Bronstein, Francesco Di Giovanni",http://arxiv.org/abs/2305.08018,,https://huggingface.co/papers/2305.08018,,,,2305.08018,4,0 SinDDM: A Single Image Denoising Diffusion Model,"Vladimir Kulikov, Shahar Yadin, Matan Kleiner, Tomer Michaeli",http://arxiv.org/abs/2211.16582,,https://huggingface.co/papers/2211.16582,,,,2211.16582,4,1 Correcting discount-factor mismatch in on-policy policy gradient methods,"Fengdi Che, Gautham Vasan, Rupam Mahmood",,,,,,,,, Iterative Approximate Cross-Validation,"Yuetian Luo, Zhimei Ren, Rina Barber",http://arxiv.org/abs/2303.02732,,https://huggingface.co/papers/2303.02732,,,,2303.02732,3,0 Conformalization of Sparse Generalized Linear Models,"Etash Guha, Eugene Ndiaye, Xiaoming Huo",,,,,,,,, Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value,"Yongchan Kwon, James Zou",,,,,,,,, Variational Mixture of HyperGenerators for Learning Distributions over Functions,"Batuhan Koyuncu, Pablo Sanchez Martin, Ignacio Peis, Pablo Olmos, Isabel Valera",http://arxiv.org/abs/2302.06223,,https://huggingface.co/papers/2302.06223,,,,2302.06223,5,0 In or Out? Fixing ImageNet Out-of-Distribution Detection Evaluation,"Julian Bitterwolf, Maximilian Müller, Matthias Hein",http://arxiv.org/abs/2306.00826,https://github.com/j-cb/NINCO,https://huggingface.co/papers/2306.00826,,,,2306.00826,3,1 Neural FIM for learning Fisher information metrics from point cloud data,"Oluwadamilola Fasina, Guillaume Huguet, Alexander Tong, Yanlei Zhang, Guy Wolf, Maximilian Nickel, Ian Adelstein, Smita Krishnaswamy",http://arxiv.org/abs/2306.06062,,https://huggingface.co/papers/2306.06062,,,,2306.06062,8,0 Understanding the Distillation Process from Deep Generative Models to Tractable Probabilistic Circuits,"Xuejie Liu, Anji Liu, Guy Van den Broeck, Yitao Liang",http://arxiv.org/abs/2302.08086,,https://huggingface.co/papers/2302.08086,,,,2302.08086,4,0 Optimal No-Regret Learning for One-Sided Lipschitz Functions,"PAUL DUETTING, Guru Guruganesh, Jon Schneider, Joshua Wang",,,,,,,,, Flexible Phase Dynamics for Bio-plausible Contrastive Learning,"Ezekiel Williams, Colin Bredenberg, Guillaume Lajoie",http://arxiv.org/abs/2302.12431,,https://huggingface.co/papers/2302.12431,,,,2302.12431,3,0 Modeling Temporal Data as Continuous Functions with Stochastic Process Diffusion,"Marin Biloš, Kashif Rasul, Anderson Schneider, Yuriy Nevmyvaka, Stephan Günnemann",http://arxiv.org/abs/2211.02590,,https://huggingface.co/papers/2211.02590,,,,2211.02590,5,0 On the Convergence Rates of Policy Gradient Methods,Lin Xiao,,,,,,,,, Generating Private Synthetic Data with Genetic Algorithms,"Terrance Liu, Jingwu Tang, Giuseppe Vietri, Steven Wu",http://arxiv.org/abs/2306.03257,,https://huggingface.co/papers/2306.03257,,,,2306.03257,4,0 E$(n)$ Equivariant Message Passing Simplicial Networks,"Floor Eijkelboom, Rob Hesselink, Erik Bekkers",,,,,,,,, Tensor Decompositions Meet Control Theory: Learning General Mixtures of Linear Dynamical Systems,"Ainesh Bakshi, Allen Liu, Ankur Moitra, morris yau",,,,,,,,, Counterfactual Identifiability of Bijective Causal Models,"Arash Nasr-Esfahany, Mohammad Alizadeh, Devavrat Shah",http://arxiv.org/abs/2302.02228,,https://huggingface.co/papers/2302.02228,,,,2302.02228,3,1 Bridging the Gap between Neural and Classical Approaches in Abstract Geometric Reasoning with Attention-based Lattice Symmetry Priors,"Mattia Atzeni, Mrinmaya Sachan, Andreas Loukas",,,,,,,,, Linear Causal Disentanglement via Interventions,"Chandler Squires, Anna Seigal, Salil Bhate, Caroline Uhler",http://arxiv.org/abs/2211.16467,,https://huggingface.co/papers/2211.16467,,,,2211.16467,4,0 abess: A Fast Best-Subset Selection Library in Python and R,"Jin Zhu, Xueqin Wang, Liyuan Hu, Junhao Huang, Kangkang Jiang, Yanhang Zhang, Shiyun Lin, Junxian Zhu",,,,,,,,, SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process,"Zichong Li, Yanbo Xu, Simiao Zuo, Haoming Jiang, Chao Zhang, Tuo Zhao, Hongyuan Zha",,,,,,,,, User-level Private Stochastic Convex Optimization with Optimal Rates,"Raef Bassily, Ziteng Sun",,,,,,,,, Improved Online Learning Algorithms for CTR Prediction in Ad Auctions,"Zhe Feng, Christopher Liaw, Zixin Zhou",,,,,,,,, Memory-Based Meta-Learning on Non-Stationary Distributions,"Tim Genewein, Gregoire Deletang, Anian Ruoss, Li Kevin Wenliang, Elliot Catt, Vincent Dutordoir, Jordi Grau-Moya, Laurent Orseau, Marcus Hutter, Joel Veness",http://arxiv.org/abs/2302.03067,,https://huggingface.co/papers/2302.03067,,,,2302.03067,10,1 Revisiting Sampling for Combinatorial Optimization,"Haoran Sun, Katayoon Goshvadi, Azade Nova, Dale Schuurmans, Hanjun Dai",,,,,,,,, Beam Tree Recursive Cells,"Jishnu Ray Chowdhury, Cornelia Caragea",http://arxiv.org/abs/2305.19999,https://github.com/JRC1995/BeamTreeRecursiveCells,https://huggingface.co/papers/2305.19999,,,,2305.19999,2,1 Posterior Sampling for Deep Reinforcement Learning,"Remo Sasso, Michelangelo Conserva, Paulo Rauber",http://arxiv.org/abs/2305.00477,,https://huggingface.co/papers/2305.00477,,,,2305.00477,3,0 Hierarchical Clustering: A Nearly-Optimal Construction for Well-Clustered Graphs,"Steinar Laenen, Bogdan Manghiuc, He Sun",,,,,,,,, Parallel neurosymbolic integration with Concordia,"Jonathan Feldstein, Modestas Jurcius, Efthymia Tsamoura",http://arxiv.org/abs/2306.00480,,https://huggingface.co/papers/2306.00480,,,,2306.00480,3,0 PFGM++: Unlocking the Potential of Physics-Inspired Generative Models,"Yilun Xu, Ziming Liu, Yonglong Tian, Shangyuan Tong, Max Tegmark, Tommi Jaakkola",http://arxiv.org/abs/2302.04265,https://github.com/Newbeeer/pfgmpp,https://huggingface.co/papers/2302.04265,,,,2302.04265,6,1 Neural Markov Jump Processes,"Patrick Seifner, Ramses J Sanchez",http://arxiv.org/abs/2305.19744,,https://huggingface.co/papers/2305.19744,,,,2305.19744,2,1 Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling,"Tianqi Chen, Mingyuan Zhou",http://arxiv.org/abs/2305.18375,,https://huggingface.co/papers/2305.18375,,,,2305.18375,2,1 "From Perception to Programs: Regularize, Overparameterize, and Amortize","Hao Tang, Kevin Ellis",http://arxiv.org/abs/2206.05922,,https://huggingface.co/papers/2206.05922,,,,2206.05922,2,1 $H$-Consistency Bounds for Pairwise Misranking Loss Surrogates,"Anqi Mao, Mehryar Mohri, Yutao Zhong",,,,,,,,, Statistical Indistinguishability of Learning Algorithms,"Alkis Kalavasis, Amin Karbasi, Shay Moran, Grigoris Velegkas",http://arxiv.org/abs/2305.14311,,https://huggingface.co/papers/2305.14311,,,,2305.14311,4,0 Reprogramming Pretrained Language Models for Antibody Sequence Infilling,"Igor Melnyk, Vijil Chenthamarakshan, Pin-Yu Chen, Payel Das, Amit Dhurandhar, Inkit Padhi, Devleena Das",,,,,,,,, Adversarially Robust PAC Learnability of Real-Valued Functions,"Idan Attias, Steve Hanneke",http://arxiv.org/abs/2206.12977,,https://huggingface.co/papers/2206.12977,,,,2206.12977,2,0 Approximate Thompson Sampling with Logarithmic Batches: Bandits and Reinforcement Learning,"Amin Karbasi, Nikki Lijing Kuang, Yian Ma, Siddharth Mitra",,,,,,,,, Adversarial robustness of amortized Bayesian inference,"Manuel Gloeckler, Michael Deistler, Jakob Macke",http://arxiv.org/abs/2305.14984,,https://huggingface.co/papers/2305.14984,,,,2305.14984,3,0 Rethinking Backdoor Attacks,"Alaa Khaddaj, Guillaume Leclerc, Aleksandar Makelov, Kristian Georgiev, Andrew Ilyas, Hadi Salman, Aleksander Madry",,,,,,,,, Kernel QuantTree,"Diego Stucchi, Paolo Rizzo, Nicolò Folloni, Giacomo Boracchi",,,,,,,,, End-to-end Training of Deep Boltzmann Machines by Unbiased Contrastive Divergence with Local Mode Initialization,"Shohei Taniguchi, Masahiro Suzuki, Yusuke Iwasawa, Yutaka Matsuo",http://arxiv.org/abs/2305.19684,,https://huggingface.co/papers/2305.19684,,,,2305.19684,4,0 Low Complexity Homeomorphic Projection to Ensure Neural-Network Solution Feasibility for Optimization over (Non-)Convex Set,"Enming Liang, Minghua Chen, Steven Low",,,,,,,,, General Sequential Episodic Memory Model,"Arjun Karuvally, Terrence Sejnowski, Hava Sieglemann",,,,,,,,, Extrapolative Controlled Sequence Generation via Iterative Refinement,"Vishakh Padmakumar, Richard Yuanzhe Pang, He He, Ankur Parikh",http://arxiv.org/abs/2303.04562,https://github.com/vishakhpk/iter-extrapolation,https://huggingface.co/papers/2303.04562,,,,2303.04562,4,1 A Model-Based Method for Minimizing CVaR and Beyond,"Si Yi Meng, Robert Gower",http://arxiv.org/abs/2305.17498,,https://huggingface.co/papers/2305.17498,,,,2305.17498,2,1 Future-conditioned Unsupervised Pretraining for Decision Transformer,"Zhihui Xie, Zichuan Lin, Deheng Ye, Qiang Fu, Wei Yang, Shuai Li",http://arxiv.org/abs/2305.16683,,https://huggingface.co/papers/2305.16683,,,,2305.16683,6,1 Ewald-based Long-Range Message Passing for Molecular Graphs,"Arthur Kosmala, Johannes Gasteiger, Nicholas Gao, Stephan Günnemann",http://arxiv.org/abs/2303.04791,,https://huggingface.co/papers/2303.04791,,,,2303.04791,4,0 Applied Online Algorithms with Heterogeneous Predictors,"Jessica Maghakian, Russell Lee, Mohammad Hajiesmaili, Jian Li, Ramesh Sitaraman, Zhenhua Liu",,,,,,,,, "Diagnosis, Feedback, Adaptation: A Human-in-the-Loop Framework for Test-Time Policy Adaptation","Andi Peng, Aviv Netanyahu, Mark Ho, Tianmin Shu, Andreea Bobu, Julie Shah, Pulkit Agrawal",,,,,,,,, Equivariance with Learned Canonicalization Functions,"Sékou-Oumar Kaba, Arnab Kumar Mondal, Yan Zhang, Yoshua Bengio, Siamak Ravanbakhsh",http://arxiv.org/abs/2211.06489,,https://huggingface.co/papers/2211.06489,,,,2211.06489,5,0 Tied-Augment: Controlling Representation Similarity Improves Data Augmentation,"Emirhan Kurtulus, Zichao Li, Yann Nicolas Dauphin, Ekin Dogus Cubuk",,,,,,,,, Linearly Constrained Bilevel Optimization: A Smoothed Implicit Gradient Approach,"Prashant Khanduri, Ioannis Tsaknakis, Yihua Zhang, Jia Liu, Sijia Liu, Jiawei Zhang, Mingyi Hong",,,,,,,,, MG-GNN: Multigrid Graph Neural Networks for Learning Multilevel Domain Decomposition Methods,"Ali Taghibakhshi, Nicolas Nytko, Tareq Uz Zaman, Scott MacLachlan, Luke Olson, Matthew West",,,,,,,,, Simple and Fast Group Robustness by Automatic Feature Reweighting,"Shikai Qiu, Andres Potapczynski, Pavel Izmailov, Andrew Wilson",,,,,,,,, An SDE for Modeling SAM: Theory and Insights,"Enea Monzio Compagnoni, Luca Biggio, Antonio Orvieto, Frank Proske, Hans Kersting, Aurelien Lucchi",http://arxiv.org/abs/2301.08203,,https://huggingface.co/papers/2301.08203,,,,2301.08203,6,0 Emergent Asymmetry of Precision and Recall for Measuring Fidelity and Diversity of Generative Models in High Dimensions,"Mahyar Khayatkhoei, Wael AbdAlmageed",http://arxiv.org/abs/2306.09618,,https://huggingface.co/papers/2306.09618,,,,2306.09618,2,0 ED-Batch: Efficient Automatic Batching of Dynamic Neural Networks via Learned Finite State Machines,"Siyuan Chen, Pratik Fegade, Phillip Gibbons, Todd Mowry, Tianqi Chen",,,,,,,,, Continuous Spatiotemporal Transformer,"Antonio Henrique de Oliveira Fonseca, Emanuele Zappala, Josue Ortega Caro, David van Dijk",,,,,,,,, Machine Learning Force Fields with Data Cost Aware Training,"Alexander Bukharin, Tianyi Liu, Shengjie Wang, Simiao Zuo, Weihao Gao, Wen Yan, Tuo Zhao",http://arxiv.org/abs/2306.03109,https://github.com/abukharin3/asteroid,https://huggingface.co/papers/2306.03109,,,,2306.03109,7,1 Hypervolume Knowledge Gradient: A Lookahead Approach for Multi-Objective Bayesian Optimization with Partial Information,"Samuel Daulton, Maximilian Balandat, Eytan Bakshy",,,,,,,,, One-shot Imitation in a Non-Stationary Environment via Multi-Modal Skill,"Sangwoo Shin, Daehee Lee, Minjong Yoo, Woo Kyung Kim, Honguk Woo",,,,,,,,, Model-based Offline Reinforcement Learning with Count-based Conservatism,"Byeongchan Kim, Min-hwan Oh",,,,,,,,, Are Gaussian Data All You Need? The Extents and Limits of Universality in High-Dimensional Generalized Linear Estimation,"Luca Pesce, FLORENT KRZAKALA, Bruno Loureiro, Ludovic Stephan",,,,,,,,, Algorithms for bounding contribution for histogram estimation under user-level privacy,"Yuhan Liu, Ananda Suresh, Wennan Zhu, Peter Kairouz, Marco Gruteser",,,,,,,,, On Pitfalls of Test-Time Adaptation,"Hao Zhao, Hao Zhao, Yuejiang Liu, Alexandre Alahi, Tao Lin",http://arxiv.org/abs/2306.03536,https://github.com/lins-lab/ttab,https://huggingface.co/papers/2306.03536,,,,2306.03536,4,1 Generalized Implicit Follow-The-Regularized-Leader,"Keyi Chen, Francesco Orabona",http://arxiv.org/abs/2306.00201,,https://huggingface.co/papers/2306.00201,,,,2306.00201,2,0 Simplified Temporal Consistency Reinforcement Learning,"Yi Zhao, Wenshuai Zhao, Rinu Boney, Kannala Juho, Joni Pajarinen",http://arxiv.org/abs/2306.09466,,https://huggingface.co/papers/2306.09466,,,,2306.09466,5,0 "Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data","Minshuo Chen, Kaixuan Huang, Tuo Zhao, Mengdi Wang",http://arxiv.org/abs/2302.07194,,https://huggingface.co/papers/2302.07194,,,,2302.07194,4,0 A Robust Test for the Stationarity Assumption in Sequential Decision Making,"Jitao Wang, Chengchun Shi, Zhenke Wu",,,,,,,,, Maximal Initial Learning Rates in Deep ReLU Networks,"Gaurav Iyer, Boris Hanin, David Rolnick",http://arxiv.org/abs/2212.07295,,https://huggingface.co/papers/2212.07295,,,,2212.07295,3,0 FAENet: Frame Averaging Equivariant GNNs for Materials Modeling,"ALEXANDRE DUVAL, Victor Schmidt, Alex Hernandez-Garcia, Fragkiskos Malliaros, Yoshua Bengio, Santiago Miret, David Rolnick",,,,,,,,, Stochastic Marginal Likelihood Gradients using Neural Tangent Kernels,"Alexander Immer, Tycho van der Ouderaa, Mark van der Wilk, Gunnar Ratsch, Bernhard Schölkopf",http://arxiv.org/abs/2306.03968,,https://huggingface.co/papers/2306.03968,,,,2306.03968,5,2 Single Point-Based Distributed Zeroth-Order Optimization with a Non-Convex Stochastic Objective Function,"Elissa Mhanna, Mohamad Assaad",,,,,,,,, On the Convergence of Gradient Flow on Multi-layer Linear Models,"Hancheng Min, Rene Vidal, Enrique Mallada",,,,,,,,, Towards Understanding and Reducing Graph Structural Noise for GNNs,"Mingze Dong, Yuval Kluger",,,,,,,,, High Probability Convergence of Stochastic Gradient Methods ,"Zijian Liu, Ta Duy Nguyen, Thien Nguyen, Alina Ene, Huy Nguyen",,,,,,,,, Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model,"Siyu Chen, Jibang Wu, Yifan Wu, Zhuoran Yang",http://arxiv.org/abs/2303.08613,,https://huggingface.co/papers/2303.08613,,,,2303.08613,4,2 SLAMB: Accelerated Large Batch Training with Sparse Communication,"Hang Xu, Wenxuan Zhang, Jiawei Fei, Yuzhe Wu, TingWen Xie, Jun Huang, Yuchen Xie, Mohamed Elhoseiny, Panos Kalnis",,,,,,,,, Efficient Quantum Algorithms for Quantum Optimal Control,"Xiantao Li, Chunhao Wang",http://arxiv.org/abs/2304.02613,,https://huggingface.co/papers/2304.02613,,,,2304.02613,2,0 Improved Policy Evaluation for Randomized Trials of Algorithmic Resource Allocation,"Aditya Mate, Bryan Wilder, Aparna Taneja, Milind Tambe",http://arxiv.org/abs/2302.02570,,https://huggingface.co/papers/2302.02570,,,,2302.02570,4,1 Variational Sparse Inverse Cholesky Approximation for Latent Gaussian Processes via Double Kullback-Leibler Minimization,"Jian Cao, Myeongjong Kang, Felix Jimenez, Huiyan Sang, Florian Schaefer, Matthias Katzfuss",http://arxiv.org/abs/2301.13303,,https://huggingface.co/papers/2301.13303,,,,2301.13303,6,0 Efficient exploration via epistemic-risk-seeking policy gradients,Brendan O'Donoghue,,,,,,,,, Probing the Deep Neural Manifold of Reinforcement Learning to Expose Volatility,"Ezgi Korkmaz, Jonah Brown-Cohen",,,,,,,,, Data-Derived Weak Universal Consistency,"Narayana Santhanam, Venkatachalam Anantharam, Wojciech Szpankowski",,,,,,,,, Auxiliary Learning as an Asymmetric Bargaining Game,"Aviv Shamsian, Aviv Navon, Neta Glazer, Kenji Kawaguchi, Gal Chechik, Ethan Fetaya",http://arxiv.org/abs/2301.13501,,https://huggingface.co/papers/2301.13501,,,,2301.13501,6,1 The Unreasonable Effectiveness of Few-shot Learning for Machine Translation,"Xavier Garcia, Yamini Bansal, Colin Cherry, George Foster, Maxim Krikun, Melvin Johnson, Orhan Firat",http://arxiv.org/abs/2302.01398,,https://huggingface.co/papers/2302.01398,,,,2302.01398,8,0 From Relational Pooling to Subgraph GNNs: A Universal Framework for More Expressive Graph Neural Networks,"Cai Zhou, Xiyuan Wang, Muhan Zhang",http://arxiv.org/abs/2305.04963,,https://huggingface.co/papers/2305.04963,,,,2305.04963,3,0 Optimizing the Collaboration Structure in Cross-Silo Federated Learning,"Wenxuan Bao, Haohan Wang, Jun Wu, Jingrui He",http://arxiv.org/abs/2306.06508,,https://huggingface.co/papers/2306.06508,,,,2306.06508,4,0 Adaptive Whitening in Neural Populations with Gain-modulating Interneurons,"Lyndon Duong, David Lipshutz, David Heeger, Dmitri Chklovskii, Eero Simoncelli",http://arxiv.org/abs/2301.11955,,https://huggingface.co/papers/2301.11955,,,,2301.11955,5,1 Identifiability and Generalizability in Constrained Inverse Reinforcement Learning,"Andreas Schlaginhaufen, Maryam Kamgarpour",http://arxiv.org/abs/2306.00629,,https://huggingface.co/papers/2306.00629,,,,2306.00629,2,0 Towards Constituting Mathematical Structures for Learning to Optimize,"Jialin Liu, Xiaohan Chen, Zhangyang “Atlas” Wang, Wotao Yin, HanQin Cai",http://arxiv.org/abs/2305.18577,,https://huggingface.co/papers/2305.18577,,,,2305.18577,5,0 Hyperparameters in Reinforcement Learning and How To Tune Them,"Theresa Eimer, Marius Lindauer, Roberta Raileanu",http://arxiv.org/abs/2306.01324,https://github.com/facebookresearch/how-to-autorl,https://huggingface.co/papers/2306.01324,,,,2306.01324,3,1 On Bridging the Gap between Mean Field and Finite Width in Deep Random Multilayer Perceptron with Batch Normalization,"Amir Joudaki, Hadi Daneshmand, Francis Bach",,,,,,,,, Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-type Samplers,"Sitan Chen, Giannis Daras, Alexandros Dimakis",http://arxiv.org/abs/2303.03384,,https://huggingface.co/papers/2303.03384,,,,2303.03384,3,0 Exact Inference in High-order Structured Prediction,"Chuyang Ke, Jean Honorio",http://arxiv.org/abs/2302.03236,,https://huggingface.co/papers/2302.03236,,,,2302.03236,2,0 When is Realizability Sufficient for Off-Policy Reinforcement Learning?,Andrea Zanette,http://arxiv.org/abs/2211.05311,,https://huggingface.co/papers/2211.05311,,,,2211.05311,1,1 Doubly Optimal No-Regret Learning in Monotone Games,"Yang Cai, Weiqiang Zheng",http://arxiv.org/abs/2301.13120,,https://huggingface.co/papers/2301.13120,,,,2301.13120,2,0 Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows,"Owen Dugan, Peter Y. Lu, Rumen Dangovski, Di Luo, Marin Solja\v{c}i\'{c}",,,,,,,,, Feature learning in deep classifiers through Intermediate Neural Collapse,"Akshay Rangamani, Marius Lindegaard, Tomer Galanti, Tomaso Poggio",,,,,,,,, On the Importance of Feature Decorrelation for Unsupervised Representation Learning in Reinforcement Learning,"Hojoon Lee, Koanho Lee, Dongyoon Hwang, Hyunho Lee, Byungkun Lee, Jaegul Choo",http://arxiv.org/abs/2306.05637,https://github.com/dojeon-ai/SimTPR,https://huggingface.co/papers/2306.05637,,,,2306.05637,6,0 BiRT: Bio-inspired Replay in Vision Transformers for Continual Learning,"Kishaan Jeeveswaran, Prashant Bhat, Bahram Zonooz, Elahe Arani",http://arxiv.org/abs/2305.04769,,https://huggingface.co/papers/2305.04769,,,,2305.04769,4,2 Effective Minkowski Dimension of Deep Nonparametric Regression: Function Approximation and Statistical Theories,"Zixuan Zhang, Minshuo Chen, Mengdi Wang, Wenjing Liao, Tuo Zhao",,,,,,,,, FARE: Provably Fair Representation Learning with Practical Certificates,"Nikola Jovanović, Mislav Balunovic, Dimitar I. Dimitrov, Martin Vechev",http://arxiv.org/abs/2210.07213,,https://huggingface.co/papers/2210.07213,,,,2210.07213,4,1 Fast Federated Machine Unlearning with Nonlinear Functional Theory,"Tianshi Che, Yang Zhou, Zijie Zhang, Lingjuan Lyu, Ji Liu, Da Yan, Dejing Dou, Jun Huan",,,,,,,,, Learning Unforeseen Robustness from Out-of-distribution Data Using Equivariant Domain Translator,"Sicheng Zhu, Bang An, Furong Huang, Sanghyun Hong",,,,,,,,, Task-Specific Skill Localization in Fine-tuned Language Models,"Abhishek Panigrahi, Nikunj Saunshi, Haoyu Zhao, Sanjeev Arora",http://arxiv.org/abs/2302.06600,,https://huggingface.co/papers/2302.06600,,,,2302.06600,4,1 Learning Optimal Group-structured Individualized Treatment Rules with Many Treatments,"Haixu Ma, Donglin Zeng, Yufeng Liu",,,,,,,,, Dynamic IMLE for Few-shot Pretraining-free Generative Modelling,"Mehran Aghabozorgi, Shichong Peng, Ke Li",,,,,,,,, Subset-Based Instance Optimality in Private Estimation,"Travis Dick, Alex Kulesza, Ziteng Sun, Ananda Suresh",http://arxiv.org/abs/2303.01262,,https://huggingface.co/papers/2303.01262,,,,2303.01262,4,0 How Do Transformers Learn Topic Structure: Towards a Mechanistic Understanding,"Yuchen Li, Yuanzhi Li, Andrej Risteski",http://arxiv.org/abs/2303.04245,,https://huggingface.co/papers/2303.04245,,,,2303.04245,3,1 DSGD-CECA: Decentralized SGD with Communication-Optimal Exact Consensus Algorithm,"Lisang Ding, Kexin Jin, Bicheng Ying, Kun Yuan, Wotao Yin",,,,,,,,, Conformal Prediction with Missing Values,"Margaux Zaffran, Aymeric Dieuleveut, Julie Josse, Yaniv Romano",http://arxiv.org/abs/2306.02732,,https://huggingface.co/papers/2306.02732,,,,2306.02732,4,0 Sparse Learning of Dynamical Systems in RKHS: An Operator-Theoretic Approach,"Boya Hou, Sina Sanjari, Nathan Dahlin, Subhonmesh Bose, Umesh Vaidya",,,,,,,,, Characterizing Multicalibration via Property Elicitation,"Georgy Noarov, Aaron Roth",,,,,,,,, Cut your Losses with Squentropy,"Like Hui, Misha Belkin, Stephen Wright",http://arxiv.org/abs/2302.03952,,https://huggingface.co/papers/2302.03952,,,,2302.03952,3,1 Multi-Agent Learning from Learners,"MINE M CALISKAN, Francesco Chini, Setareh Maghsudi",,,,,,,,, Oracles and Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning,"Matthias Gerstgrasser, David Parkes",,,,,,,,, Robust Counterfactual Explanations for Neural Networks With Probabilistic Guarantees,"Faisal Hamman, Erfaun Noorani, Saumitra Mishra, Daniele Magazzeni, Sanghamitra Dutta",http://arxiv.org/abs/2305.11997,,https://huggingface.co/papers/2305.11997,,,,2305.11997,5,1 Theoretical Behavior of XAI Methods in the Presence of Suppressor Variables,"Rick Wilming, Leo Kieslich, Benedict Clark, Stefan Haufe",http://arxiv.org/abs/2306.01464,,https://huggingface.co/papers/2306.01464,,,,2306.01464,4,1 When do Minimax-fair Learning and Empirical Risk Minimization Coincide?,"Harvineet Singh, Matthäus Kleindessner, Volkan Cevher, Rumi Chunara, Chris Russell",,,,,,,,, Semi-Autoregressive Energy Flows: Towards Determinant-Free Training of Normalizing Flows ,"Phillip Si, Zeyi Chen, Subham S Sahoo, Yair Schiff, Volodymyr Kuleshov",,,,,,,,, Global optimality of Elman-type RNNs in the mean-field regime,"Andrea Agazzi, Andrea Agazzi, Jianfeng Lu, Sayan Mukherjee",,,,,,,,, A Large-Scale Study of Probabilistic Calibration in Neural Network Regression,"Victor Dheur, Souhaib Ben Taieb",http://arxiv.org/abs/2306.02738,,https://huggingface.co/papers/2306.02738,,,,2306.02738,2,1 Selective Machine Learning of the Average Treatment Effect with an Invalid Instrumental Variable,"Baoluo Sun, Yifan Cui, Eric Tchetgen Tchetgen",http://arxiv.org/abs/1907.11882,,https://huggingface.co/papers/1907.11882,,,,1907.11882,3,0 Partial Optimality in Cubic Correlation Clustering,"David Stein, Silvia Di Gregorio, Bjoern Andres",http://arxiv.org/abs/2302.04694,,https://huggingface.co/papers/2302.04694,,,,2302.04694,3,0 Universal Physics-Informed Neural Networks: Symbolic Differential Operator Discovery with Sparse Data,"Lena Podina, Brydon Eastman, Mohammad Kohandel",,,,,,,,, Open-Vocabulary Universal Image Segmentation with MaskCLIP,"Zheng Ding, Jacky Wang, Zhuowen Tu",http://arxiv.org/abs/2208.08984,,https://huggingface.co/papers/2208.08984,,,,2208.08984,3,0 DRCFS: Doubly Robust Causal Feature Selection,"Francesco Quinzan, Ashkan Soleymani, Patrick Jaillet, Cristian R. Rojas, Stefan Bauer",http://arxiv.org/abs/2306.07024,,https://huggingface.co/papers/2306.07024,,,,2306.07024,5,0 Image Shortcut Squeezing: Countering Perturbative Availability Poisons with Compression,"Zhuoran Liu, Zhengyu Zhao, Martha Larson",http://arxiv.org/abs/2301.13838,,https://huggingface.co/papers/2301.13838,,,,2301.13838,3,0 Bootstrap in High Dimension with Low Computation,"Henry Lam, Zhenyuan Liu",http://arxiv.org/abs/2210.10974,,https://huggingface.co/papers/2210.10974,,,,2210.10974,2,0 Approximate Causal Effect Identification under Weak Confounding,"Ziwei Jiang, Lai Wei, Murat Kocaoglu",,,,,,,,, Mixing Predictions for Online Metric Algorithms,"Antonios Antoniadis, Christian Coester, Marek Elias, Adam Polak, Bertrand Simon",http://arxiv.org/abs/2304.01781,,https://huggingface.co/papers/2304.01781,,,,2304.01781,5,0 Deep Generative Symbolic Regression with Monte-Carlo-Tree-Search,"Pierre-Alexandre Kamienny, Guillaume Lample, sylvain lamprier, Marco Virgolin",http://arxiv.org/abs/2302.11223,,https://huggingface.co/papers/2302.11223,,,,2302.11223,4,0 Stable and Consistent Prediction of 3D Characteristic Orientation via Invariant Residual Learning,"Seungwook Kim, Chunghyun Park, Yoonwoo Jeong, Jaesik Park, Minsu Cho",,,,,,,,, TabLeak: Tabular Data Leakage in Federated Learning,"Mark Vero, Mislav Balunovic, Dimitar I. Dimitrov, Martin Vechev",,,,,,,,, IRNeXt: Rethinking Convolutional Network Design for Image Restoration,"Yuning Cui, Wenqi Ren, Sining Yang, Xiaochun Cao, Alois Knoll",,,,,,,,, Active Policy Improvement from Multiple Black-box Oracles,"Xuefeng Liu, Takuma Yoneda, Chaoqi Wang, Matthew Walter, Yuxin Chen",,,,,,,,, Learning Temporally AbstractWorld Models without Online Experimentation,"Benjamin Freed, Siddarth Venkatraman, Guillaume Sartoretti, Jeff Schneider, Howie Choset",,,,,,,,, Short-lived High-volume Bandits,"Su Jia, Nishant Oli, Andrew Li, R. Ravi, Paul Duff, Ian Anderson",,,,,,,,, Action Matching: Learning Stochastic Dynamics from Samples,"Kirill Neklyudov, Rob Brekelmans, Daniel Severo, Alireza Makhzani",http://arxiv.org/abs/2210.06662,,https://huggingface.co/papers/2210.06662,,,,2210.06662,4,0 Robust and private stochastic linear bandits,"Vasilis Charisopoulos, Hossein Esfandiari, Vahab Mirrokni",,,,,,,,, Internet Explorer: Targeted Representation Learning on the Open Web,"Alexander Li, Ellis Brown, Alexei Efros, Deepak Pathak",http://arxiv.org/abs/2302.14051,,https://huggingface.co/papers/2302.14051,,,,2302.14051,4,1 The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation,"Philip Amortila, Nan Jiang, Csaba Szepesvari",,,,,,,,, Predicting Rare Events by Shrinking Towards Proportional Odds,"Gregory Faletto, Jacob Bien",http://arxiv.org/abs/2305.18700,,https://huggingface.co/papers/2305.18700,,,,2305.18700,2,0 Model-based Reinforcement Learning with Scalable Composite Policy Gradient Estimators,"Paavo Parmas, Takuma Seno, Yuma Aoki",,,,,,,,, Slot-VAE: Object-Centric Scene Generation with Slot Attention,"Yanbo Wang, Letao Liu, Justin Dauwels",,,,,,,,, Lottery Tickets in Evolutionary Optimization: On Sparse Backpropagation-Free Trainability,"Robert Lange, Henning Sprekeler",http://arxiv.org/abs/2306.00045,,https://huggingface.co/papers/2306.00045,,,,2306.00045,2,0 Polynomial Preconditioning for Gradient Methods,"Nikita Doikov, Anton Rodomanov",http://arxiv.org/abs/2301.13194,,https://huggingface.co/papers/2301.13194,,,,2301.13194,2,0 MultiAdam: Parameter-wise Scale-invariant Optimizer for Physics-informed Neural Network,"Jiachen Yao, Chang Su, Zhongkai Hao, LIU SONGMING, Hang Su, Jun Zhu",,,,,,,,, Scaling Up Dataset Distillation to ImageNet-1K with Constant Memory,"Justin Cui, Ruochen Wang, Si Si, Cho-Jui Hsieh",http://arxiv.org/abs/2211.10586,,https://huggingface.co/papers/2211.10586,,,,2211.10586,4,0 Learning Belief Representations for Partially Observable Deep RL,"Andrew Wang, Andrew C Li, Rodrigo A Toro Icarte, Toryn Q Klassen, Sheila McIlraith",,,,,,,,, Hierarchical Learning in Hyperbolic Space: Revisit and Beyond,"Menglin Yang, Min Zhou, Rex Ying, yankai Chen, Irwin King",,,,,,,,, Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modularized Learning,"Zhongzhi Yu, Yang Zhang, Kaizhi Qian, Cheng Wan, Yonggan Fu, Yongan Zhang, Yingyan (Celine) Lin",,,,,,,,, Towards Understanding the Role of Attention in Prompt-tuning,"Samet Oymak, Ankit Singh Rawat, Mahdi Soltanolkotabi, Christos Thrampoulidis",,,,,,,,, Simple Disentanglement of Style and Content in Visual Representations,"Lilian Ngweta, Subha Maity, Alex Gittens, Yuekai Sun, Mikhail Yurochkin",http://arxiv.org/abs/2302.09795,,https://huggingface.co/papers/2302.09795,,,,2302.09795,5,4 On Regularization and Inference with Label Constraints,"Kaifu Wang, Hangfeng He, Tin Nguyen, Piyush Kumar, Dan Roth",,,,,,,,, Improved Learning-Augmented Algorithms for the Multi-Option Ski Rental Problem via Best-Possible Competitive Analysis,"Yongho Shin, Changyeol Lee, Gukryeol Lee, Hyung-Chan An",http://arxiv.org/abs/2302.06832,,https://huggingface.co/papers/2302.06832,,,,2302.06832,4,0 Towards Learning Geometric Eigen-Lengths Crucial for Fitting Tasks,"Yijia Weng, Kaichun Mo, Ruoxi Shi, Yanchao Yang, Leonidas Guibas",,,,,,,,, The Hessian perspective into the Nature of Convolutional Neural Networks,"Sidak Pal Singh, Thomas Hofmann, Bernhard Schölkopf",http://arxiv.org/abs/2305.09088,,https://huggingface.co/papers/2305.09088,,,,2305.09088,3,1 MultiresNet: Sequence Modeling with Multiresolution Convolutional Memory,"Jiaxin Shi, Ke Alexander Wang, Emily Fox",,,,,,,,, Unsupervised Out-of-Distribution Detection with Diffusion Inpainting,"Zhenzhen Liu, Jin Zhou, Yufan Wang, Kilian Weinberger",http://arxiv.org/abs/2302.10326,,https://huggingface.co/papers/2302.10326,,,,2302.10326,4,2 Are Neurons Actually Collapsed? On the Fine-Grained Structure in Neural Representations,"Yongyi Yang, Jacob Steinhardt, Wei Hu",,,,,,,,, Contextual Reliability: When Different Features Matter in Different Contexts,"Gaurav Ghosal, Amrith Setlur, Daniel S Brown, Anca Dragan, Aditi Raghunathan",,,,,,,,, On Enhancing Expressive Power via Compositions of Single Fixed-Size ReLU Network,"Shijun Zhang, Jianfeng Lu, Hongkai Zhao",http://arxiv.org/abs/2301.12353,,https://huggingface.co/papers/2301.12353,,,,2301.12353,3,0 Towards Sustainable Learning: Coresets for Data-efficient Deep Learning,"Yu Yang, Hao Kang, Baharan Mirzasoleiman",http://arxiv.org/abs/2306.01244,,https://huggingface.co/papers/2306.01244,,,,2306.01244,3,1 Domain Adaptation for Time Series Under Feature and Label Shifts,"Huan He, Owen Queen, Teddy Koker, Consuelo Cuevas, Theodoros Tsiligkaridis, Marinka Zitnik",http://arxiv.org/abs/2302.03133,,https://huggingface.co/papers/2302.03133,,,,2302.03133,6,0 Can Neural Network Memorization Be Localized?,"Pratyush Maini, Michael Mozer, Hanie Sedghi, Zachary Lipton, Zico Kolter, Chiyuan Zhang",,,,,,,,, Differentiable Tree Operations Promote Compositional Generalization,"Paul Soulos, Edward Hu, Kate McCurdy, Yunmo Chen, Roland Fernandez, Paul Smolensky, Jianfeng Gao",http://arxiv.org/abs/2306.00751,,https://huggingface.co/papers/2306.00751,,,,2306.00751,7,0 Autoregressive Diffusion Model for Graph Generation,"Lingkai Kong, Jiaming Cui, Haotian Sun, Yuchen Zhuang, B. Aditya Prakash, Chao Zhang",,,,,,,,, Optimistic Planning by Regularized Dynamic Programming,"Antoine Moulin, Gergely Neu",http://arxiv.org/abs/2302.14004,,https://huggingface.co/papers/2302.14004,,,,2302.14004,2,0 Unconstrained Online Learning with Unbounded Losses,"Andrew Jacobsen, Ashok Cutkosky",http://arxiv.org/abs/2306.04923,,https://huggingface.co/papers/2306.04923,,,,2306.04923,2,0 Block subsampled randomized Hadamard transform for Nystr ̈om approximation on distributed architectures,"Oleg Balabanov, Laura Grigori, Matthias Beaupère, Victor Lederer",,,,,,,,, Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP,"Jiacheng Guo, Zihao Li, Huazheng Wang, Mengdi Wang, Zhuoran Yang, Xuezhou Zhang",,,,,,,,, NeRFool: Uncovering the Vulnerability of Generalizable Neural Radiance Fields against Adversarial Perturbations,"Yonggan Fu, Ye Yuan, Souvik Kundu, Shang Wu, Shunyao Zhang, Yingyan (Celine) Lin",http://arxiv.org/abs/2306.06359,https://github.com/GATECH-EIC/NeRFool,https://huggingface.co/papers/2306.06359,,,,2306.06359,6,0 Disentangled Multi-Fidelity Deep Bayesian Active Learning,"Dongxia Wu, Ruijia Niu, Matteo Chinazzi, Yian Ma, Rose Yu",http://arxiv.org/abs/2305.04392,,https://huggingface.co/papers/2305.04392,,,,2305.04392,5,1 GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models,"Hanjing Wang, Man-Kit Sit, Congjie He, Ying Wen, Weinan Zhang, Jun Wang, Yaodong Yang, Luo Mai",,,,,,,,, Adaptive Estimation of Graphical Models under Total Positivity,"Jiaxi Ying, Jose Vinicius de Miranda Cardoso, Daniel Palomar",http://arxiv.org/abs/2210.15471,,https://huggingface.co/papers/2210.15471,,,,2210.15471,3,0 The Dormant Neuron Phenomenon in Deep Reinforcement Learning,"Ghada Sokar, Rishabh Agarwal, Pablo Samuel Castro, Utku Evci",http://arxiv.org/abs/2302.12902,,https://huggingface.co/papers/2302.12902,,,,2302.12902,4,1 Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning,"Sam Lobel, Akhil Bagaria, George Konidaris",http://arxiv.org/abs/2306.03186,,https://huggingface.co/papers/2306.03186,,,,2306.03186,3,0 Are labels informative in semi-supervised learning? Estimating and leveraging the missing-data mechanism.,"Aude Sportisse, Hugo Schmutz, Olivier HUMBERT, Charles Bouveyron, Pierre-Alexandre Mattei",,,,,,,,, Memory-Based Dual Gaussian Processes for Sequential Learning,"Paul Chang, Prakhar Verma, ST John, Arno Solin, Khan Emtiyaz",http://arxiv.org/abs/2306.03566,,https://huggingface.co/papers/2306.03566,,,,2306.03566,5,0 OCD: Learning to Overfit with Conditional Diffusion Models,"Shahar Lutati, Lior Wolf",http://arxiv.org/abs/2210.00471,https://github.com/ShaharLutatiPersonal/OCD,https://huggingface.co/papers/2210.00471,,,,2210.00471,2,0 Neural Continuous-Discrete State Space Models for Irregularly-Sampled Time Series,"Abdul Fatir Ansari, Alvin Heng, Andre Lim, Harold Soh",http://arxiv.org/abs/2301.11308,,https://huggingface.co/papers/2301.11308,,,,2301.11308,4,1 Conditions and Assumptions for Constraint-based Causal Structure Learning,"Kayvan Sadeghi, Terry Soo",http://arxiv.org/abs/2103.13521,,https://huggingface.co/papers/2103.13521,,,,2103.13521,2,0 Efficient RL via Disentangled Environment and Agent Representations,"Kevin Gmelin, Shikhar Bahl, Russell Mendonca, Deepak Pathak",,,,,,,,, Returning The Favour: When Regression Benefits From Probabilistic Causal Knowledge,"Shahine Bouabid, Jake Fawkes, Dino Sejdinovic",http://arxiv.org/abs/2301.11214,,https://huggingface.co/papers/2301.11214,,,,2301.11214,3,0 Simple MViT: A Hierarchical Vision Transformer without the Bells-and-Whistles,"Chaitanya Ryali, Yuan-Ting Hu, Daniel Bolya, Chen Wei, Haoqi Fan, Po-Yao Huang, Vaibhav Aggarwal, Arkabandhu Chowdhury, Omid Poursaeed, Judy Hoffman, Jitendra Malik, Yanghao Li, Christoph Feichtenhofer",,,,,,,,, SparseGPT: Massive Language Models Can be Accurately Pruned in One-Shot,"Elias Frantar, Dan Alistarh",http://arxiv.org/abs/2301.00774,https://github.com/IST-DASLab/sparsegpt,https://huggingface.co/papers/2301.00774,,,,2301.00774,2,0 "Generalization on the Unseen, Logic Reasoning and Degree Curriculum","Emmanuel Abbe, Samy Bengio, Aryo Lotfi, Kevin Rizk",http://arxiv.org/abs/2301.13105,,https://huggingface.co/papers/2301.13105,,,,2301.13105,4,1 Fourmer: An Efficient Global Modeling Paradigm for Image Restoration,"Man Zhou, Jie Huang, Chunle Guo, Chongyi Li, Chongyi Li",,,,,,,,, Dynamics-inspired Neuromorphic Visual Representation Learning,"Zhengqi Pei, Shuhui Wang",,,,,,,,, Facial Expression Recognition with Adaptive Frame Rate based on Multiple Testing Correction,Andrey Savchenko,,,,,,,,, Multi-Epoch Matrix Factorization Mechanisms for Private Machine Learning,"Christopher Choquette-Choo, Hugh B McMahan, J K Rush, Abhradeep Guha Thakurta",http://arxiv.org/abs/2211.06530,,https://huggingface.co/papers/2211.06530,,,,2211.06530,4,1 Fast Private Kernel Density Estimation via Locality Sensitive Quantization,"Tal Wagner, Yonatan Naamad, Nina Mishra",,,,,,,,, Bayesian Design Principles for Frequentist Sequential Learning,"Yunbei Xu, Assaf Zeevi",,,,,,,,, Simplex Random Features,"Isaac Reid, Krzysztof Choromanski, Valerii Likhosherstov, Adrian Weller",http://arxiv.org/abs/2301.13856,,https://huggingface.co/papers/2301.13856,,,,2301.13856,4,0 Tighter Information-Theoretic Generalization Bounds from Supersamples,"Ziqiao Wang, Yongyi Mao",http://arxiv.org/abs/2302.02432,,https://huggingface.co/papers/2302.02432,,,,2302.02432,2,0 How Bad is Top-$K$ Recommendation under Competing Content Creators?,"Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, Haifeng Xu",,,,,,,,, Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural Networks,"Mohammed Nowaz Rabbani Chowdhury, Shuai Zhang, Meng Wang, Sijia Liu, Pin-Yu Chen",http://arxiv.org/abs/2306.04073,,https://huggingface.co/papers/2306.04073,,,,2306.04073,5,0 Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time,"Zichang Liu, Jue Wang, Tri Dao, Tianyi Zhou, Binhang Yuan, Zhao Song, Anshumali Shrivastava, Ce Zhang, Yuandong Tian, Christopher Re, Beidi Chen",,,,,,,,, Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models,"Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Ding, Zhangyang “Atlas” Wang",,,,,,,,, Learning GFlowNets From Partial Episodes For Improved Convergence And Stability,"Kanika Madan, Jarrid Rector-Brooks, Maksym Korablyov, Emmanuel Bengio, Moksh Jain, Andrei-Cristian Nica, Tom Bosc, Yoshua Bengio, Nikolay Malkin",http://arxiv.org/abs/2209.12782,,https://huggingface.co/papers/2209.12782,,,,2209.12782,9,1 Which Features are Learnt by Contrastive Learning? On the Role of Simplicity Bias in Class Collapse and Feature Suppression,"Yihao Xue, Siddharth Joshi, Eric Gan, Pin-Yu Chen, Baharan Mirzasoleiman",http://arxiv.org/abs/2305.16536,,https://huggingface.co/papers/2305.16536,,,,2305.16536,5,0 Denoising MCMC for Accelerating Diffusion-Based Generative Models,"Beomsu Kim, Jongchul Ye",http://arxiv.org/abs/2209.14593,https://github.com/1202kbs/DMCMC,https://huggingface.co/papers/2209.14593,,,,2209.14593,2,0 A Fully First-Order Method for Stochastic Bilevel Optimization,"Jeongyeol Kwon, Dohyun Kwon, Stephen Wright, Robert Nowak",http://arxiv.org/abs/2301.10945,,https://huggingface.co/papers/2301.10945,,,,2301.10945,4,0 Inferring Relational Potentials in Interacting Systems,"Armand Comas, Yilun Du, Christian Fernandez Lopez, Sandesh Ghimire, Mario Sznaier, Josh Tenenbaum, Octavia Camps",,,,,,,,, Gaussian Process Priors for Systems of Linear Partial Differential Equations with Constant Coefficients,"Marc Harkonen, Markus Lange-Hegermann, Bogdan Raita",http://arxiv.org/abs/2212.14319,,https://huggingface.co/papers/2212.14319,,,,2212.14319,3,1 Spherical Fourier Neural Operators: Learning Stable Dynamics on the Sphere,"Boris Bonev, Thorsten Kurth, Christian Hundt, Jaideep Pathak, Maximilian Baust, Karthik Kashinath, Anima Anandkumar",http://arxiv.org/abs/2306.03838,,https://huggingface.co/papers/2306.03838,,,,2306.03838,7,1 Meta-learning based Adaptive Stability Certificates in Dynamical Systems,"Amit Jena, Dileep Kalathil, Le Xie",,,,,,,,, Nonparametric Extensions of Randomized Response for Private Confidence Sets,"Ian Waudby-Smith, Steven Wu, Aaditya Ramdas",http://arxiv.org/abs/2202.08728,,https://huggingface.co/papers/2202.08728,,,,2202.08728,3,0 TRAK: Understanding Model Predictions at Scale,"Sung Min (Sam) Park, Kristian Georgiev, Andrew Ilyas, Guillaume Leclerc, Aleksander Madry",,,,,,,,, Data Feedback Loops: Model-driven Amplification of Dataset Biases,"Rohan Taori, Tatsunori Hashimoto",http://arxiv.org/abs/2209.03942,https://github.com/rtaori/data_feedback,https://huggingface.co/papers/2209.03942,,,,2209.03942,2,2 Fast Inference from Transformers via Speculative Decoding,"Yaniv Leviathan, Matan Kalman, Yossi Matias",http://arxiv.org/abs/2211.17192,,https://huggingface.co/papers/2211.17192,,,,2211.17192,3,0 Learning Mixtures of Markov Chains and MDPs,"Chinmaya Kausik, Kevin Tan, Ambuj Tewari",http://arxiv.org/abs/2211.09403,,https://huggingface.co/papers/2211.09403,,,,2211.09403,3,0 A Watermark for Large Language Models,"John Kirchenbauer, Jonas Geiping, Yuxin Wen, Jonathan Katz, Ian Miers, Tom Goldstein",http://arxiv.org/abs/2301.10226,https://github.com/jwkirchenbauer/lm-watermarking,https://huggingface.co/papers/2301.10226,https://huggingface.co/spaces/tomg-group-umd/lm-watermarking,,,2301.10226,6,3 On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness,"Haotian Ye, Xiaoyu Chen, Liwei Wang, Simon Du",http://arxiv.org/abs/2210.10464,,https://huggingface.co/papers/2210.10464,,,,2210.10464,4,1 Diffusion Models are Minimax Optimal Distribution Estimators,"Kazusato Oko, Shunta Akiyama, Taiji Suzuki",http://arxiv.org/abs/2303.01861,,https://huggingface.co/papers/2303.01861,,,,2303.01861,3,0 AdaBoost is not an Optimal Weak to Strong Learner,"Mikael Møller Høgsgaard, Mikael Høgsgaard, Kasper Green Larsen, Martin Ritzert",http://arxiv.org/abs/2301.11571,,https://huggingface.co/papers/2301.11571,,,,2301.11571,3,1 Exponential Smoothing for Off-Policy Learning,"Imad AOUALI, Victor-Emmanuel Brunel, David Rohde, Anna Korba",http://arxiv.org/abs/2305.15877,,https://huggingface.co/papers/2305.15877,,,,2305.15877,4,0 On the Statistical Benefits of Temporal Difference Learning,"David Cheikhi, Daniel Russo",http://arxiv.org/abs/2301.13289,,https://huggingface.co/papers/2301.13289,,,,2301.13289,2,0 Bayes-optimal Learning of Deep Random Networks of Extensive-width,"Hugo Cui, FLORENT KRZAKALA, Lenka Zdeborova",,,,,,,,, Adapting to game trees in zero-sum imperfect information games,"Côme Fiegel, Pierre Menard, Tadashi Kozuno, Remi Munos, Vianney Perchet, Michal Valko",http://arxiv.org/abs/2212.12567,,https://huggingface.co/papers/2212.12567,,,,2212.12567,6,1 Adversarial Policies Beat Superhuman Go AIs,"Tony Wang, Adam Gleave, Tom Tseng, Nora Belrose, Kellin Pelrine, Joseph Miller, Michael Dennis, Yawen Duan, Viktor Pogrebniak, Sergey Levine, Stuart Russell",http://arxiv.org/abs/2211.00241,,https://huggingface.co/papers/2211.00241,,,,2211.00241,11,2 Pretraining Language Models with Human Preferences,"Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Bhalerao, Christopher Buckley, Jason Phang, Samuel Bowman, Ethan Perez",http://arxiv.org/abs/2302.08582,,https://huggingface.co/papers/2302.08582,,,,2302.08582,8,2 Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples,"Chumeng Liang, Xiaoyu Wu, Yang Hua, Jiaru Zhang, Yiming Xue, Tao Song, Zhengui XUE, Ruhui Ma, Haibing Guan",http://arxiv.org/abs/2302.04578,https://github.com/mist-project/mist.git,https://huggingface.co/papers/2302.04578,,,,2302.04578,9,0 A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs,"Mikael Henaff, Minqi Jiang, Roberta Raileanu",http://arxiv.org/abs/2306.03236,,https://huggingface.co/papers/2306.03236,,,,2306.03236,3,0 Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies,"Gati Aher, Rosa I. Arriaga, Adam Tauman Kalai",http://arxiv.org/abs/2208.10264,,https://huggingface.co/papers/2208.10264,,,,2208.10264,3,1 Best of Both Worlds Policy Optimization,"Christoph Dann, Chen-Yu Wei, Julian Zimmert",http://arxiv.org/abs/2302.09408,,https://huggingface.co/papers/2302.09408,,,,2302.09408,3,0 Raising the Cost of Malicious AI-Powered Image Editing,"Hadi Salman, Alaa Khaddaj, Guillaume Leclerc, Andrew Ilyas, Aleksander Madry",http://arxiv.org/abs/2302.06588,,https://huggingface.co/papers/2302.06588,,,,2302.06588,5,1 The Price of Differential Privacy under Continual Observation,"Satchit Sivakumar, Sofya Raskhodnikova, Palak Jain, Adam Smith",http://arxiv.org/abs/2112.00828,,https://huggingface.co/papers/2112.00828,,,,2112.00828,4,0 Differentially Private Hierarchical Clustering with Provable Approximation Guarantees,"Jacob Imola, Alessandro Epasto, Mohammad Mahdian, Vincent Cohen-Addad, Vahab Mirrokni",,,,,,,,, Buying Information for Stochastic Optimization,"Mingchen Ma, Christos Tzamos",http://arxiv.org/abs/2306.03607,,https://huggingface.co/papers/2306.03607,,,,2306.03607,2,0 Towards Theoretical Understanding of Inverse Reinforcement Learning,"Alberto Maria Metelli, Filippo Lazzati, Marcello Restelli",http://arxiv.org/abs/2304.12966,,https://huggingface.co/papers/2304.12966,,,,2304.12966,3,0 Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond,"Jaeyoung Cha, Jaewook Lee, Chulhee Yun",http://arxiv.org/abs/2303.07160,,https://huggingface.co/papers/2303.07160,,,,2303.07160,3,0 Delayed Feedback in Kernel Bandits,"Sattar Vakili, Danyal Ahmed, Alberto Bernacchia, Ciara Pike-Burke",http://arxiv.org/abs/2302.00392,,https://huggingface.co/papers/2302.00392,,,,2302.00392,4,0 Sharper Bounds for $\ell_p$ Sensitivity Sampling,"David Woodruff, Taisuke Yasuda",http://arxiv.org/abs/2306.00732,,https://huggingface.co/papers/2306.00732,,,,2306.00732,2,1 Hyena Hierarchy: Towards Larger Convolutional Language Models,"Michael Poli, Stefano Massaroli, Eric Nguyen, Daniel Y Fu, Tri Dao, Stephen Baccus, Yoshua Bengio, Stefano Ermon, Christopher Re",http://arxiv.org/abs/2302.10866,,https://huggingface.co/papers/2302.10866,,,,2302.10866,9,2 Delving into Noisy Label Detection with Clean Data,"Chenglin Yu, Xinsong Ma, Weiwei Liu",,,,,,,,, GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration,"Naoki Murata, Koichi Saito, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon",http://arxiv.org/abs/2301.12686,,https://huggingface.co/papers/2301.12686,,,,2301.12686,7,1 Information-Theoretic State Space Model for Multi-View Reinforcement Learning,"HyeongJoo Hwang, Seokin Seo, Youngsoo Jang, Sungyoon Kim, Geon-Hyeong Kim, Seunghoon Hong, Kee-Eung Kim",,,,,,,,, Learning Control-Oriented Dynamical Structure from Data,"Spencer M. Richards, Jean-Jacques Slotine, Navid Azizan, Marco Pavone",http://arxiv.org/abs/2302.02529,,https://huggingface.co/papers/2302.02529,,,,2302.02529,4,0 Multicalibration as Boosting for Regression,"Ira Globus-Harris, Declan Harrison, Michael Kearns, Aaron Roth, Jessica Sorrell",http://arxiv.org/abs/2301.13767,https://github.com/Declancharrison/Level-Set-Boosting,https://huggingface.co/papers/2301.13767,,,,2301.13767,5,0 Towards Reliable Neural Specifications,"Chuqin Geng, Van Nham Le, Xiaojie Xu, Zhaoyue Wang, Arie Gurfinkel, Xujie Si",http://arxiv.org/abs/2210.16114,,https://huggingface.co/papers/2210.16114,,,,2210.16114,6,0 Spherical Inducing Features for Orthogonally-Decoupled Gaussian Processes,"Louis Chi-Chun Tiao, Vincent Dutordoir, Victor Picheny",http://arxiv.org/abs/2304.14034,,https://huggingface.co/papers/2304.14034,,,,2304.14034,3,0 Provably Learning Object-Centric Representations,"Jack Brady, Roland Zimmermann, Yash Sharma, Bernhard Schölkopf, Julius von Kügelgen, Wieland Brendel",http://arxiv.org/abs/2305.14229,,https://huggingface.co/papers/2305.14229,,,,2305.14229,6,0 Equivariant Architectures for Learning in Deep Weight Spaces,"Aviv Navon, Aviv Shamsian, Idan Achituve, Ethan Fetaya, Gal Chechik, Haggai Maron",http://arxiv.org/abs/2301.12780,,https://huggingface.co/papers/2301.12780,,,,2301.12780,6,1 Difference of submodular minimization via DC programming,"Marwa El Halabi, George Orfanides, Tim Hoheisel",http://arxiv.org/abs/2305.11046,,https://huggingface.co/papers/2305.11046,,,,2305.11046,3,1 Practical and Matching Gradient Variance Bounds for Black-Box Variational Bayesian Inference,"Kyurae Kim, Kaiwen Wu, Jisu Oh, Jacob Gardner",http://arxiv.org/abs/2303.10472,,https://huggingface.co/papers/2303.10472,,,,2303.10472,4,1 Pre-training for Speech Translation: CTC Meets Optimal Transport,"Phuong-Hang Le, Hongyu Gong, Changhan Wang, Juan Pino, Benjamin Lecouteux, Didier Schwab",http://arxiv.org/abs/2301.11716,https://github.com/formiel/fairseq,https://huggingface.co/papers/2301.11716,,,,2301.11716,6,1 StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis,"Axel Sauer, Axel Sauer, Tero Karras, Samuli Laine, Andreas Geiger, Timo Aila",,,,,,,,, BEATs: Audio Pre-Training with Acoustic Tokenizers,"Sanyuan Chen, Yu Wu, Chengyi Wang, Shujie Liu, Daniel Tompkins, Zhuo Chen, Wanxiang Che, Xiangzhan Yu, Furu Wei",http://arxiv.org/abs/2212.09058,,https://huggingface.co/papers/2212.09058,,,,2212.09058,7,0 "Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language","Alexei Baevski, Arun Babu, Wei-Ning Hsu, Michael Auli",http://arxiv.org/abs/2212.07525,,https://huggingface.co/papers/2212.07525,,,,2212.07525,4,1 " Mu$^2$SLAM: Multitask, Multilingual Speech and Language Models","Yong Cheng, Yu Zhang, Melvin Johnson, Wolfgang Macherey, Ankur Bapna",,,,,,,,, ODS: Test-Time Adaptation in the Presence of Open-World Data Shift,"Zhi Zhou, Lan-Zhe Guo, Lin-Han Jia, Dingchu Zhang, Yu-Feng Li",,,,,,,,, "Rockmate: an Efficient, Fast, Automatic and Generic Tool for Re-materialization in PyTorch","Xunyi Zhao, Théotime Le Hellard, Lionel Eyraud-Dubois, Julia Gusak, Olivier Beaumont",,,,,,,,, Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization,"Chanyeong Kim, Jongwoong Park, Hyunglip Bae, Woo Chang Kim",,,,,,,,, Resurrecting Recurrent Neural Networks for Long Sequences,"Antonio Orvieto, Samuel Smith, Albert Gu, Anushan Fernando, Caglar Gulchere, Razvan Pascanu, Soham De",http://arxiv.org/abs/2303.06349,,https://huggingface.co/papers/2303.06349,,,,2303.06349,7,0 Tilted Sparse Additive Models,"Yingjie Wang, Hong Chen, Weifeng Liu, Fengxiang He, Tieliang Gong, YouCheng Fu, Dacheng Tao",,,,,,,,, Dynamic Regularized Sharpness Aware Minimization in Federated Learning: Approaching Global Consistency and Smooth Landscape,"Yan Sun, Li Shen, Shixiang Chen, Liang Ding, Dacheng Tao",,,,,,,,, Sketch-Flip-Merge: Mergeable Sketches for Private Distinct Count,"Jonathan Hehir, Jonathan Heier, Daniel Ting, Graham Cormode",,,,,,,,, AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners,"Zhixuan Liang, Yao Mu, Mingyu Ding, Fei Ni, Masayoshi Tomizuka, Ping Luo",http://arxiv.org/abs/2302.01877,,https://huggingface.co/papers/2302.01877,,,,2302.01877,6,0 Reinforcement Learning from Passive Data via Latent Intentions,"Dibya Ghosh, Chethan Bhateja, Sergey Levine",http://arxiv.org/abs/2304.04782,,https://huggingface.co/papers/2304.04782,,,,2304.04782,3,0 Hierarchies of Reward Machines,"Daniel Furelos-Blanco, Mark Law, Anders Jonsson, Krysia Broda, Alessandra Russo",http://arxiv.org/abs/2205.15752,,https://huggingface.co/papers/2205.15752,,,,2205.15752,5,1 Calibrating Multimodal Learning,"Huan Ma, qingyang zhang, Changqing Zhang, Bingzhe Wu, Huazhu Fu, Joey Tianyi Zhou, Qinghua Hu",http://arxiv.org/abs/2306.01265,,https://huggingface.co/papers/2306.01265,,,,2306.01265,6,0 Cones: Concept Neurons in Diffusion Models for Customized Generation,"Zhiheng Liu, Ruili Feng, Kai Zhu, Yifei Zhang, Kecheng Zheng, Yu Liu, Deli Zhao, Jingren Zhou, Yang Cao",http://arxiv.org/abs/2303.05125,,https://huggingface.co/papers/2303.05125,,,,2303.05125,9,2 Continuation Path Learning for Homotopy Optimization,"Xi Lin, Zhiyuan Yang, Xiaoyuan Zhang, Qingfu Zhang",,,,,,,,, "Inflow, Outflow, and Reciprocity in Machine Learning","Mukund Sundararajan, Walid Krichene",,,,,,,,, Transformers Learn In-Context by Gradient Descent,"Johannes Von Oswald, Eyvind Niklasson, Ettore Randazzo, Joao Sacramento, Alexander Mordvintsev, Andrey Zhmoginov, Max Vladymyrov",http://arxiv.org/abs/2212.07677,https://github.com/google-research/self-organising-systems/tree/master/transformers_learn_icl_by_gd,https://huggingface.co/papers/2212.07677,,,,2212.07677,7,1 When Personalization Harms Performance: Reconsidering the Use of Group Attributes in Prediction,"Vinith Suriyakumar, Marzyeh Ghassemi, Berk Ustun",,,,,,,,, Evaluating Self-Supervised Learning via Risk Decomposition,"Yann Dubois, Tatsunori Hashimoto, Percy Liang",http://arxiv.org/abs/2302.03068,https://github.com/YannDubs/SSL-Risk-Decomposition,https://huggingface.co/papers/2302.03068,,,,2302.03068,3,1 Graphically Structured Diffusion Models,"Christian Weilbach, William Harvey, Frank Wood",http://arxiv.org/abs/2210.11633,,https://huggingface.co/papers/2210.11633,,,,2210.11633,3,1 Semi Bandit dynamics in Congestion Games: Convergence to Nash Equilibrium and No-Regret Guarantees.,"Ioannis Panageas, EFSTRATIOS PANTELEIMON SKOULAKIS, Luca Viano, Xiao Wang, Volkan Cevher",,,,,,,,, DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature,"Eric Mitchell, Yoonho Lee, Alexander Khazatsky, Christopher Manning, Chelsea Finn",http://arxiv.org/abs/2301.11305,,https://huggingface.co/papers/2301.11305,,,,2301.11305,5,1 RankMe: Assessing the Downstream Performance of Pretrained Self-Supervised Representations by Their Rank,"Quentin Garrido, Randall Balestriero, Laurent Najman, Yann LeCun",http://arxiv.org/abs/2210.02885,,https://huggingface.co/papers/2210.02885,,,,2210.02885,4,1 Self-Interpretable Time Series Prediction with Counterfactual Explanations,"Jingquan Yan, Hao Wang",http://arxiv.org/abs/2306.06024,,https://huggingface.co/papers/2306.06024,,,,2306.06024,2,0 H-Likelihood Approach to Deep Neural Networks with Temporal-Spatial Random Effects for High-Cardinality Categorical Features,"Hangbin Lee, Youngjo Lee",,,,,,,,, Uncertain Evidence in Probabilistic Models and Stochastic Simulators,"Andreas Munk, Alexander Mead, Frank Wood",http://arxiv.org/abs/2210.12236,,https://huggingface.co/papers/2210.12236,,,,2210.12236,3,1 On the Effectiveness of Offline RL for Dialogue Response Generation,"Paloma Sodhi, Felix Wu, Ethan Elenberg, Kilian Weinberger, Ryan Mcdonald",,,,,,,,, Multi-Environment Pretraining Enables Transfer to Action Limited Datasets,"David Venuto, Mengjiao Yang, Pieter Abbeel, Doina Precup, Igor Mordatch, Ofir Nachum",http://arxiv.org/abs/2211.13337,,https://huggingface.co/papers/2211.13337,,,,2211.13337,6,0 Tractable Control for Auto-regressive Language Generation,"Honghua Zhang, Meihua Dang, Nanyun Peng, Guy Van den Broeck",,,,,,,,, Stratified Adversarial Robustness with Rejection,"Jiefeng Chen, Jayaram Raghuram, Jihye Choi, Xi Wu, Yingyiu Liang, Somesh Jha",http://arxiv.org/abs/2305.01139,,https://huggingface.co/papers/2305.01139,,,,2305.01139,6,0 Concept-based Explanations for Out-of-Distribution Detectors,"Jihye Choi, Jayaram Raghuram, Ryan Feng, Jiefeng Chen, Somesh Jha, Atul Prakash",http://arxiv.org/abs/2203.02586,,https://huggingface.co/papers/2203.02586,,,,2203.02586,6,1 SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks at the Edge,"Mahdi Nikdan, Tommaso Pegolotti, Eugenia Iofinova, Eldar Kurtic, Dan Alistarh",,,,,,,,, Achieving Linear Speedup in Non-IID Federated Bilevel Learning,"Minhui Huang, Dewei Zhang, Kaiyi Ji",http://arxiv.org/abs/2302.05412,,https://huggingface.co/papers/2302.05412,,,,2302.05412,3,0 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation,"Yiming Cui, Linjie Yang, Haichao Yu",,,,,,,,, Function-Space Regularization in Neural Networks: A Probabilistic Perspective,"Tim G. J. Rudner, Sanyam Kapoor, Shikai Qiu, Andrew Wilson",,,,,,,,, Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models,"Matthew Muckley, Alaaeldin El-Nouby, Karen Ullrich, Herve Jegou, Jakob Verbeek",http://arxiv.org/abs/2301.11189,,https://huggingface.co/papers/2301.11189,,,,2301.11189,5,2 Retrieval-Augmented Multimodal Language Modeling,"Michihiro Yasunaga, Armen Aghajanyan, Weijia Shi, Richard James, Jure Leskovec, Percy Liang, Mike Lewis, Luke Zettlemoyer, Scott Yih",http://arxiv.org/abs/2211.12561,,https://huggingface.co/papers/2211.12561,,,,2211.12561,9,1 A Framework for Adapting Offline Algorithms to Solve Combinatorial Multi-Armed Bandit Problems with Bandit Feedback,"Guanyu Nie, Yididiya Nadew, Yanhui Zhu, Vaneet Aggarwal, Christopher J Quinn",http://arxiv.org/abs/2301.13326,,https://huggingface.co/papers/2301.13326,,,,2301.13326,5,1 NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion,"Jiatao Gu, Alex Trevithick, Kai-En Lin, Joshua M Susskind, Christian Theobalt, Lingjie Liu, Ravi Ramamoorthi",http://arxiv.org/abs/2302.10109,,https://huggingface.co/papers/2302.10109,,,,2302.10109,7,1 Improving Bi-level Optimization Based Methods with Inspiration from Humans' Classroom Study Techniques,Pengtao Xie,,,,,,,,, Learning Compiler Pass Orders using Coreset and Normalized Value Prediction,"Youwei Liang, Kevin Stone, Ali Shameli, Chris Cummins, Mostafa Elhoushi, Jiadong Guo, Benoit Steiner, Xiaomeng Yang, Pengtao Xie, Hugh Leather, Yuandong Tian",http://arxiv.org/abs/2301.05104,,https://huggingface.co/papers/2301.05104,,,,2301.05104,11,2 Learning useful representations for shifting tasks and distributions,"Jianyu Zhang, Leon Bottou",http://arxiv.org/abs/2212.07346,,https://huggingface.co/papers/2212.07346,,,,2212.07346,2,1 Model Ratatouille: Recycling Diverse Models for Out-of-Distribution Generalization,"Alexandre Rame, Kartik Ahuja, Jianyu Zhang, Matthieu Cord, Leon Bottou, David Lopez-Paz",http://arxiv.org/abs/2212.10445,,https://huggingface.co/papers/2212.10445,,,,2212.10445,6,1 Target-based Surrogates for Stochastic Optimization,"Jonathan Lavington, Sharan Vaswani, Reza Babanezhad, Mark Schmidt, Nicolas Le Roux",http://arxiv.org/abs/2302.02607,,https://huggingface.co/papers/2302.02607,,,,2302.02607,5,0 NNSplitter: An Active Defense Solution for DNN Model via Automated Weight Obfuscation,"Tong Zhou, Yukui Luo, Shaolei Ren, Xiaolin Xu",http://arxiv.org/abs/2305.00097,https://github.com/Tongzhou0101/NNSplitter,https://huggingface.co/papers/2305.00097,,,,2305.00097,4,0 Sketched Ridgeless Linear Regression: The Role of Downsampling,"Xin Chen, Yicheng Zeng, Siyue Yang, Qiang Sun",http://arxiv.org/abs/2302.01088,,https://huggingface.co/papers/2302.01088,,,,2302.01088,4,0 Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation,"Siddharth Nagar Nayak, Kenneth Choi, Wenqi Ding, Sydney Dolan, Karthik Gopalakrishnan, Hamsa Balakrishnan",http://arxiv.org/abs/2211.02127,https://github.com/nsidn98/InforMARL,https://huggingface.co/papers/2211.02127,,,,2211.02127,6,1 POUF: Prompt-Oriented Unsupervised Fine-tuning for Large Pre-trained Models,"Korawat Tanwisuth, Shujian Zhang, Huangjie Zheng, Pengcheng He, Mingyuan Zhou",http://arxiv.org/abs/2305.00350,,https://huggingface.co/papers/2305.00350,,,,2305.00350,5,0 Towards Omni-generalizable Neural Methods for Vehicle Routing Problems,"Jianan Zhou, Yaoxin Wu, Wen Song, Zhiguang Cao, Jie Zhang",http://arxiv.org/abs/2305.19587,https://github.com/RoyalSkye/Omni-VRP,https://huggingface.co/papers/2305.19587,,,,2305.19587,5,0 Protecting Language Generation Models via Invisible Watermarking,"Xuandong Zhao, Yu-Xiang Wang, Lei Li",http://arxiv.org/abs/2302.03162,,https://huggingface.co/papers/2302.03162,,,,2302.03162,3,1 Global Optimization with Parametric Function Approximation,"Chong Liu, Yu-Xiang Wang",http://arxiv.org/abs/2211.09100,,https://huggingface.co/papers/2211.09100,,,,2211.09100,2,0 Non-stationary Reinforcement Learning under General Function Approximation,"Songtao Feng, Ming Yin, Ruiquan Huang, Yu-Xiang Wang, Jing Yang, Yingbin LIANG",http://arxiv.org/abs/2306.00861,,https://huggingface.co/papers/2306.00861,,,,2306.00861,6,1 Demystifying Disagreement-on-the-Line in High Dimensions,"Donghwan Lee, Behrad Moniri, Xinmeng Huang, Edgar Dobriban, Hamed Hassani",http://arxiv.org/abs/2301.13371,,https://huggingface.co/papers/2301.13371,,,,2301.13371,5,0 Multisample Flow Matching: Straightening Flows with Minibatch Couplings,"Aram-Alexandre Pooladian, Heli Ben-Hamu, Carles Domingo i Enrich, Brandon Amos, Yaron Lipman, Ricky T. Q. Chen",http://arxiv.org/abs/2304.14772,,https://huggingface.co/papers/2304.14772,,,,2304.14772,6,1 Competitive Gradient Optimization,"Abhijeet Vyas, Brian Bullins, Kamyar Azizzadenesheli",http://arxiv.org/abs/2205.14232,,https://huggingface.co/papers/2205.14232,,,,2205.14232,2,0 Magneto: A Foundation Transformer,"Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng, Yu Wu, Payal Bajaj, Saksham Singhal, Alon Benhaim, Barun Patra, Zhun Liu, Vishrav Chaudhary, Xia Song, Furu Wei",,,,,,,,, Robustly Learning a Single Neuron via Sharpness,"Puqian Wang, Nikos Zarifis, Ilias Diakonikolas, Jelena Diakonikolas",http://arxiv.org/abs/2306.07892,,https://huggingface.co/papers/2306.07892,,,,2306.07892,4,0 Offline Meta Reinforcement Learning with In-Distribution Online Adaptation,"Jianhao Wang, Jin Zhang, Haozhe Jiang, Junyu Zhang, Liwei Wang, Chongjie Zhang",http://arxiv.org/abs/2305.19529,,https://huggingface.co/papers/2305.19529,,,,2305.19529,6,0 Strategic Classification with Unknown User Manipulations,"Tosca Lechner, Ruth Urner, Shai Ben-David",,,,,,,,, Pruning via Sparsity-indexed ODE: a Continuous Sparsity Viewpoint,"Zhanfeng Mo, Haosen Shi, Sinno Jialin Pan",,,,,,,,, Neural Network Accelerated Implicit Filtering: Integrating Neural Network Surrogates With Provably Convergent Derivative Free Optimization Methods,"Brian Irwin, Eldad Haber, Raviv Gal, Avi Ziv",,,,,,,,, Bayesian online change point detection with Hilbert space approximate Student-t process,"Jeremy Sellier, Petros Dellaportas",,,,,,,,, Multiple Thinking Achieving Meta-Ability Decoupling for Object Navigation,"Ronghao Dang, Lu Chen, Liuyi Wang, Zongtao He, Chengju Liu, Qijun Chen",http://arxiv.org/abs/2302.01520,,https://huggingface.co/papers/2302.01520,,,,2302.01520,6,0 Exploring the Benefits of Training Expert Language Models over Instruction Tuning,"Joel Jang, Seungone Kim, Seonghyeon Ye, Doyoung Kim, Lajanugen Logeswaran, Moontae Lee, Kyungjae Lee, Minjoon Seo",http://arxiv.org/abs/2302.03202,https://github.com/joeljang/ELM,https://huggingface.co/papers/2302.03202,,,,2302.03202,8,1 Feature Expansion for Graph Neural Networks,"Jiaqi Sun, Lin Zhang, Guangyi Chen, Peng XU, Kun Zhang, Yujiu Yang",http://arxiv.org/abs/2305.06142,,https://huggingface.co/papers/2305.06142,,,,2305.06142,6,1 LEVER: Learning to Verify Language-to-Code Generation with Execution,"Ansong Ni, Srinivasan Iyer, Dragomir Radev, Veselin Stoyanov, Scott Yih, Sida Wang, Xi Lin",http://arxiv.org/abs/2302.08468,,https://huggingface.co/papers/2302.08468,,,,2302.08468,7,2 Multi-species multi-task benchmark for learned representations of behavior,"Jennifer J. Sun, Markus Marks, Andrew Ulmer, Dipam Chakraborty, Brian Geuther, Edward Hayes, Heng Jia, Vivek Kumar, Sebastian Oleszko, Zachary Partridge, Milan Peelman, Alice Robie, Catherine Schretter, Keith Sheppard, Chao Sun, Param Uttarwar, Julian Wagner, Erik Werner, Joseph Parker, Pietro Perona, Yisong Yue, Kristin Branson, Ann Kennedy",,,,,,,,, Random Shuffle Transformer for Image Restoration,"Jie Xiao, Xueyang Fu, Man Zhou, Hongjian Liu, Zheng-Jun Zha",,,,,,,,, Active Ranking of Experts Based on their Performances in Many Tasks,"El Mehdi Saad, Nicolas Verzelen, Alexandra Carpentier",http://arxiv.org/abs/2306.02628,,https://huggingface.co/papers/2306.02628,,,,2306.02628,3,0 Quantifying the Variability Collapse of Neural Networks,"Jing Xu, Haoxiong Liu",http://arxiv.org/abs/2306.03440,,https://huggingface.co/papers/2306.03440,,,,2306.03440,2,0 Personalized Subgraph Federated Learning,"Jinheon Baek, Wonyong Jeong, Jiongdao Jin, Jaehong Yoon, Sung Ju Hwang",http://arxiv.org/abs/2206.10206,https://github.com/JinheonBaek/FED-PUB,https://huggingface.co/papers/2206.10206,,,,2206.10206,5,0 Whose Opinions Do Language Models Reflect?,"Shibani Santurkar, Cinoo Lee, Esin Durmus, Faisal Ladhak, Tatsunori Hashimoto, Percy Liang",http://arxiv.org/abs/2303.17548,https://github.com/tatsu-lab/opinions_qa,https://huggingface.co/papers/2303.17548,,,,2303.17548,6,1 Flexible Model Aggregation for Quantile Regression,"Rasool Fakoor, Taesup Kim, Jonas Mueller, Alexander Smola, Ryan Tibshirani",http://arxiv.org/abs/2103.00083,,https://huggingface.co/papers/2103.00083,,,,2103.00083,5,1 Faster Rates of Convergence to Stationary Points in Differentially Private Optimization,"Raman Arora, Raef Bassily, Tomás González, Cristobal Guzman, Michael Menart, Enayat Ullah",http://arxiv.org/abs/2206.00846,,https://huggingface.co/papers/2206.00846,,,,2206.00846,6,0 From Adaptive Query Release to Machine Unlearning,"Enayat Ullah, Raman Arora",,,,,,,,, Private Federated Learning with Autotuned Compression,"Enayat Ullah, Christopher Choquette-Choo, Peter Kairouz, Sewoong Oh",,,,,,,,, Probabilistic Imputation for Time-series Classification with Missing Data,"SeungHyun Kim, Hyunsu Kim, Eunggu Yun, Hwangrae Lee, Jaehun Lee, Juho Lee",,,,,,,,, Certifying Ensembles: A General Certification Theory with S-Lipschitzness,"Aleksandar Petrov, Francisco Eiras, Amartya Sanyal, Phil Torr, Adel Bibi",,,,,,,,, Demystifying Uneven Vulnerability of Link Stealing Attacks against Graph Neural Networks,"He Zhang, Bang Wu, Shuo Wang, Xiangwen Yang, Minhui Xue, Shirui Pan, Xingliang YUAN",,,,,,,,, Bayesian Estimation of Differential Privacy,"Santiago Zanella-Beguelin, Lukas Wutschitz, Shruti Tople, Ahmed Salem, Victor Ruehle, Andrew Paverd, Mohammad Naseri, Boris Köpf, Dan Jones",http://arxiv.org/abs/2206.05199,,https://huggingface.co/papers/2206.05199,,,,2206.05199,9,3 N$\text{A}^\text{2}$Q: Neural Attention Additive Model for Interpretable Multi-Agent Q-Learning,"Zichuan Liu, Yuanyang Zhu, Chunlin Chen",,,,,,,,, Behavior Contrastive Learning for Unsupervised Skill Discovery,"Rushuai Yang, Chenjia Bai, Hongyi Guo, Siyuan Li, Bin Zhao, Zhen Wang, Peng Liu, Xuelong Li",http://arxiv.org/abs/2305.04477,,https://huggingface.co/papers/2305.04477,,,,2305.04477,8,1 End-to-End Multi-Object Detection with a Regularized Mixture Model,"Jaeyoung Yoo, Hojun Lee, Seunghyeon Seo, Inseop Chung, NOJUN KWAK",http://arxiv.org/abs/2205.08714,,https://huggingface.co/papers/2205.08714,,,,2205.08714,5,0 Nonlinear Causal Discovery with Latent Confounders,"David Kaltenpoth, Jilles Vreeken",,,,,,,,, Global optimality for Euclidean CCCP under Riemannian convexity,"Melanie Weber, Suvrit Sra",,,,,,,,, Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains,"Vishwaraj Doshi, Jie Hu, Do-Young Eun",,,,,,,,, Distribution-dependent McDiarmid-type Inequalities for Functions of Unbounded Interaction,"Shaojie Li, Yong Liu",,,,,,,,, Identifying Interpretable Subspaces in Image Representations,"Neha Mukund Kalibhat, Shweta Bhardwaj, C. Bayan Bruss, Hamed Firooz, Maziar Sanjabi, Soheil Feizi",,,,,,,,, LegendreTron: Uprising Proper Multiclass Loss Learning,"Kevin H. Lam, Christian Walder, Spiridon Penev, Richard Nock",http://arxiv.org/abs/2301.11695,,https://huggingface.co/papers/2301.11695,,,,2301.11695,4,1 R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents,"Daniel D. Johnson, Daniel Tarlow, Christian Walder",,,,,,,,, High-dimensional Location Estimation via Norm Concentration for Subgamma Vectors,"Shivam Gupta, Jasper Lee, Eric Price",http://arxiv.org/abs/2302.02497,,https://huggingface.co/papers/2302.02497,,,,2302.02497,3,0 COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models,"Jinqi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan",http://arxiv.org/abs/2305.17235,,https://huggingface.co/papers/2305.17235,,,,2305.17235,6,2 Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling,"Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal",http://arxiv.org/abs/2304.01373,https://github.com/EleutherAI/pythia,https://huggingface.co/papers/2304.01373,,,,2304.01373,13,9 HyperTuning: Toward Adapting Large Language Models without Back-propagation,"Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen",,,,,,,,, Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models,"Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2302.00618,,https://huggingface.co/papers/2302.00618,,,,2302.00618,6,1 Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise,"Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2212.11685,https://github.com/microsoft/ProphetNet/tree/master/GENIE,https://huggingface.co/papers/2212.11685,,,,2212.11685,8,0 Fast $(1+\varepsilon)$-Approximation Algorithms for Binary Matrix Factorization,"Ameya Velingker, Maximilian Vötsch, David Woodruff, Samson Zhou",,,,,,,,, Exphormer: Sparse Transformers for Graphs,"Hamed Shirzad, Ameya Velingker, Balaji Venkatachalam, Danica J Sutherland, Ali K Sinop",http://arxiv.org/abs/2303.06147,https://github.com/hamed1375/Exphormer,https://huggingface.co/papers/2303.06147,,,,2303.06147,5,1 Multiplier Bootstrap-based Exploration,"Runzhe Wan, Haoyu Wei, Branislav Kveton, Rui Song",http://arxiv.org/abs/2302.01543,,https://huggingface.co/papers/2302.01543,,,,2302.01543,4,0 Sequential Strategic Screening ,"Lee Cohen, Saeed Sharifi-Malvajerdi, Kevin Stangl, Ali Vakilian, Juba Ziani",,,,,,,,, Robust Subtask Learning for Compositional Generalization,"Kishor Jothimurugan, Steve Hsu, Osbert Bastani, Rajeev Alur",http://arxiv.org/abs/2302.02984,,https://huggingface.co/papers/2302.02984,,,,2302.02984,4,0 Hindsight Learning for MDPs with Exogenous Inputs,"Sean R. Sinclair, Felipe Vieira Frujeri, Ching-An Cheng, Luke Marshall, Hugo Barbalho, Jingling Li, Jennifer Neville, Ishai Menache, Adith Swaminathan",http://arxiv.org/abs/2207.06272,,https://huggingface.co/papers/2207.06272,,,,2207.06272,9,0 Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding,"Kenton Lee, Mandar Joshi, Iulia Turc, Hexiang Hu, Fangyu Liu, Julian M Eisenschlos, Urvashi Khandelwal, Peter Shaw, Ming-Wei Chang, Kristina Toutanova",http://arxiv.org/abs/2210.03347,,https://huggingface.co/papers/2210.03347,,,,2210.03347,10,4 Settling the Reward Hypothesis,"John Martin, Michael Bowling, David Abel, Will Dabney",http://arxiv.org/abs/2212.10420,,https://huggingface.co/papers/2212.10420,,,,2212.10420,4,0 The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation,"Mark Rowland, Yunhao Tang, Clare Lyle, Remi Munos, Marc Bellemare, Will Dabney",http://arxiv.org/abs/2305.18388,,https://huggingface.co/papers/2305.18388,,,,2305.18388,6,0 Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition,"Yash Chandak, Shantanu Thakoor, Zhaohan Guo, Yunhao Tang, Remi Munos, Will Dabney, Diana Borsa",http://arxiv.org/abs/2305.00654,,https://huggingface.co/papers/2305.00654,,,,2305.00654,7,0 Bootstrapped Representations in Reinforcement Learning,"Charline Le Lan, Stephen Tu, Mark Rowland, Anna Harutyunyan, Rishabh Agarwal, Marc Bellemare, Will Dabney",,,,,,,,, Quantile Credit Assignment,"Thomas Mesnard, Wenqi Chen, Alaa Saade, Yunhao Tang, Mark Rowland, Theophane Weber, Clare Lyle, Audrunas Gruslys, Michal Valko, Will Dabney, Georg Ostrovski, Eric Moulines, Remi Munos",,,,,,,,, Understanding Self-Predictive Learning for Reinforcement Learning,"Yunhao Tang, Zhaohan Guo, Pierre Richemond, Bernardo Avila Pires, Yash Chandak, Remi Munos, Mark Rowland, Mohammad Gheshlaghi Azar, Charline Le Lan, Clare Lyle, Andras Gyorgy, Shantanu Thakoor, Will Dabney, Bilal Piot, Daniele Calandriello, Michal Valko",http://arxiv.org/abs/2212.03319,,https://huggingface.co/papers/2212.03319,,,,2212.03319,16,1 Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning,"Brett Daley, Martha White, Christopher Amato, Marlos C. Machado",http://arxiv.org/abs/2301.11321,,https://huggingface.co/papers/2301.11321,,,,2301.11321,4,0 "For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal","Yingdong Hu, Renhao Wang, Li Li, Yang Gao",http://arxiv.org/abs/2304.04591,,https://huggingface.co/papers/2304.04591,,,,2304.04591,4,1 Weakly Supervised Regression with Interval Targets,"Xin Cheng, Yuzhou Cao, Ximing Li, Bo An, LEI FENG",,,,,,,,, Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents,"WENHAO XU, Xuefeng Gao, Xuedong He",http://arxiv.org/abs/2301.12601,,https://huggingface.co/papers/2301.12601,,,,2301.12601,3,0 Decentralized Stochastic Bilevel Optimization with Improved per-Iteration Complexity,"Xuxing Chen, Minhui Huang, Shiqian Ma, Krishna Balasubramanian",http://arxiv.org/abs/2210.12839,,https://huggingface.co/papers/2210.12839,,,,2210.12839,4,0 Optimal randomized multilevel Monte Carlo for repeatedly nested expectations,"Yasa Syed, Guanyang Wang",http://arxiv.org/abs/2301.04095,,https://huggingface.co/papers/2301.04095,,,,2301.04095,2,1 "Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repeat","Shantanu Ghosh, Ke Yu, Forough Arabshahi, Kayhan Batmanghelich",,,,,,,,, CrossSplit: Mitigating Label Noise Memorization through Data Splitting,"Jihye Kim, Aristide Baratin, Yan Zhang, Simon Lacoste-Julien",http://arxiv.org/abs/2212.01674,,https://huggingface.co/papers/2212.01674,,,,2212.01674,4,0 On Investigating the Conservative Property of Score-Based Generative Models,"Chen-Hao Chao, Wei-Fang Sun, Bo-Wun Cheng, Chun-Yi Lee",http://arxiv.org/abs/2209.12753,,https://huggingface.co/papers/2209.12753,,,,2209.12753,4,0 "Convex Geometry of ReLU-layers, Injectivity on the Ball and Local Reconstruction","Daniel Haider, Peter Balazs, Martin Ehler",,,,,,,,, Smart Initial Basis Selection for Linear Programs,"Zhenan Fan, Xinglu Wang, Oleksandr Yakovenko, Abdullah Ali Sivas, Owen Ren, Yong Zhang, Zirui Zhou",,,,,,,,, A Study on Transformer Configuration and Training Objective,"Fuzhao Xue, Fuzhao Xue, Jianghai Chen, Aixin Sun, Xiaozhe Ren, Zangwei Zheng, Xiaoxin He, Yongming Chen, Xin Jiang, Yang You",http://arxiv.org/abs/2205.10505,,https://huggingface.co/papers/2205.10505,,,,2205.10505,9,0 Constrained Monotonic Neural Networks,"Davor Runje, Sharath M Shankaranarayana",http://arxiv.org/abs/2205.11775,,https://huggingface.co/papers/2205.11775,,,,2205.11775,2,1 Sample and Predict Your Latent: Modality-free Sequential Disentanglement via Contrastive Estimation,"Ilan Naiman, Nimrod Berman, Omri Azencot",http://arxiv.org/abs/2305.15924,https://github.com/azencot-group/SPYL,https://huggingface.co/papers/2305.15924,,,,2305.15924,3,0 Data Representations' Study of Latent Image Manifolds,"Ilya Kaufman, Omri Azencot",http://arxiv.org/abs/2305.19730,https://github.com/azencot-group/CRLM,https://huggingface.co/papers/2305.19730,,,,2305.19730,2,0 Momentum Ensures Convergence of SIGNSGD under Weaker Assumptions,"Tao Sun, Qingsong Wang, Dongsheng Li, Bao Wang",,,,,,,,, Fair and Optimal Multi-Class Classification via Post-Processing,"Ruicheng Xian, Lang Yin, Han Zhao",,,,,,,,, High-dimensional Clustering onto Hamiltonian Cycle,"Tianyi Huang, Shenghui Cheng, Stan Z Li, Zhengjun Zhang",http://arxiv.org/abs/2304.14531,,https://huggingface.co/papers/2304.14531,,,,2304.14531,4,0 Revisiting Domain Randomization via Relaxed State-Adversarial Policy Optimization,"Yun-Hsuan Lien, Ping-Chun Hsieh, Yu-Shuen Wang",,,,,,,,, Fine-Tuning Language Models via Epistemic Neural Networks,"Ian Osband, Seyed Mohammad Asghari, Benjamin Van Roy, Nat McAleese, John Aslanides, Geoffrey Irving",http://arxiv.org/abs/2211.01568,,https://huggingface.co/papers/2211.01568,,,,2211.01568,6,0 Dual Focal Loss for Calibration,"Linwei Tao, Minjing Dong, Chang Xu",http://arxiv.org/abs/2305.13665,https://github.com/Linwei94/DualFocalLoss,https://huggingface.co/papers/2305.13665,,,,2305.13665,3,1 "Existence, Stability and Scalability of Orthogonal Convolutional Neural Networks","El Mehdi Achour, Francois Malgouyres, Franck Mamalet",http://arxiv.org/abs/2108.05623,,https://huggingface.co/papers/2108.05623,,,,2108.05623,3,0 Phase Transitions in the Detection of Correlated Databases,"Dor Elimelech, Wasim Huleihel",http://arxiv.org/abs/2302.03380,,https://huggingface.co/papers/2302.03380,,,,2302.03380,2,0 New metrics and search algorithms for weighted causal DAGs,"Davin Choo, Kirankumar Shiragur",http://arxiv.org/abs/2305.04445,,https://huggingface.co/papers/2305.04445,,,,2305.04445,2,0 CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms,"Shengyi Huang, Rousslan Fernand Julien Dossa, Chang Ye, Jeff Braga, Dipam Chakraborty, Kinal Mehta, João Madeira Araujo",http://arxiv.org/abs/2111.08819,https://github.com/vwxyzjn/cleanrl,https://huggingface.co/papers/2111.08819,,,,2111.08819,4,1 Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning,"Wu Lin, Valentin Duruisseaux, Melvin Leok, Frank Nielsen, Khan Emtiyaz, Mark Schmidt",http://arxiv.org/abs/2302.09738,https://github.com/yorkerlin/StructuredNGD-DL,https://huggingface.co/papers/2302.09738,,,,2302.09738,6,1 Polarity Is All You Need to Learn and Transfer Faster,"Alice (Qingyang) Wang, Michael Powell, Eric Bridgeford, Ali Geisa, Joshua Vogelstein",http://arxiv.org/abs/2303.17589,,https://huggingface.co/papers/2303.17589,,,,2303.17589,5,1 Scaling Vision Transformers to 22 Billion Parameters,"Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin, Rodolphe Jenatton, Lucas Beyer, Michael Tschannen, Anurag Arnab, Xiao Wang, Carlos Riquelme, Matthias Minderer, Joan Puigcerver, Utku Evci, Manoj Kumar, Sjoerd van Steenkiste, Gamaleldin Elsayed, Aravindh Mahendran, Fisher Yu, Avital Oliver, Fantine Huot, Jasmijn Bastings, Mark Collier, Alexey Gritsenko, Vighnesh N Birodkar, Cristina Vasconcelos, Yi Tay, Thomas Mensink, Alexander Kolesnikov, Filip Pavetic, Dustin Tran, Thomas Kipf, Mario Lucic, Xiaohua Zhai, Daniel Keysers, Jeremiah Harmsen, Neil Houlsby",http://arxiv.org/abs/2302.05442,,https://huggingface.co/papers/2302.05442,,,,2302.05442,42,2 Toward Fair and Robust Estimation of Optimal Treatment Regimes,"Kwangho Kim, Jose Zubizarreta",,,,,,,,, Internally Rewarded Reinforcement Learning,"Mengdi Li, Xufeng Zhao, Jae Hee Lee, Cornelius Weber, Stefan Wermter",http://arxiv.org/abs/2302.00270,,https://huggingface.co/papers/2302.00270,,,,2302.00270,5,0 FedVS: Straggler-Resilient and Privacy-Preserving Vertical Federated Learning for Split Models,"Songze Li, Duanyi YAO, Jin Liu",http://arxiv.org/abs/2304.13407,,https://huggingface.co/papers/2304.13407,,,,2304.13407,3,0 Byzantine-Robust Learning on Heterogeneous Data via Gradient Splitting,"Yuchen Liu, Chen Chen, Lingjuan Lyu, Fangzhao Wu, Sai Wu, Gang Chen",http://arxiv.org/abs/2302.06079,https://github.com/YuchenLiu-a/byzantine-gas,https://huggingface.co/papers/2302.06079,,,,2302.06079,6,0 BPipe: Memory-Balanced Pipeline Parallelism for Training Large Language Models,"Taebum Kim, Hyoungjoo Kim, Gyeong-In Yu, Byung-Gon Chun",,,,,,,,, Stable Estimation of Heterogeneous Treatment Effects,"Anpeng Wu, Kun Kuang, Ruoxuan Xiong, Bo Li, Fei Wu",,,,,,,,, Tight Data Access Bounds for Private Top-$k$ Selection,"Hao WU, Olga Ohrimenko, Anthony Wirth",,,,,,,,, I$^2$SB: Image-to-Image Schrödinger Bridge,"Guan-Horng Liu, Arash Vahdat, De-An Huang, Evangelos Theodorou, Weili Nie, Anima Anandkumar",,,,,,,,, Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming,"Jinuk Kim, Yeonwoo Jeong, Deokjae Lee, Hyun Oh Song",http://arxiv.org/abs/2301.12187,,https://huggingface.co/papers/2301.12187,,,,2301.12187,4,1 Critical Points and Convergence Analysis of Generative Deep Linear Networks Trained with Bures-Wasserstein Loss,"Pierre Bréchet, Katerina Papagiannouli, Jing An, Guido Montufar",http://arxiv.org/abs/2303.03027,,https://huggingface.co/papers/2303.03027,,,,2303.03027,4,1 Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach,"Yanwei Jia, Xun Yu Zhou",http://arxiv.org/abs/2108.06655,,https://huggingface.co/papers/2108.06655,,,,2108.06655,2,0 VIMA: Robot Manipulation with Multimodal Prompts,"Yunfan Jiang, Agrim Gupta, Zichen Zhang, Guanzhi Wang, Yongqiang Dou, Yanjun Chen, Li Fei-Fei, Anima Anandkumar, Yuke Zhu, Jim Fan",,,,,,,,, StriderNet: A Graph Reinforcement Learning Approach to Optimize Atomic Structures on Rough Energy Landscapes,"Vaibhav Bihani, Sahil Manchanda, Srikanth Sastry, Sayan Ranu, N M Anoop Krishnan",http://arxiv.org/abs/2301.12477,,https://huggingface.co/papers/2301.12477,,,,2301.12477,5,1 Multi-agent Online Scheduling: MMS Allocations for Indivisible Items,"Shengwei Zhou, Rufan Bai, Xiaowei Wu",http://arxiv.org/abs/2304.13405,,https://huggingface.co/papers/2304.13405,,,,2304.13405,3,0 Multi-Symmetry Ensembles: Improving Diversity and Generalization via Opposing Symmetries,"Charlotte Loh, Seungwook Han, Shivchander Sudalairaj, Rumen Dangovski, Kai Xu, Florian Wenzel, Marin Solja\v{c}i\'{c}, Akash Srivastava",http://arxiv.org/abs/2303.02484,,https://huggingface.co/papers/2303.02484,,,,2303.02484,8,1 NP-SemiSeg: When Neural Processes meet Semi-Supervised Semantic Segmentation,"Jianfeng Wang, Daniela Massiceti, Xiaolin Hu, Vladimir Pavlovic, Thomas Lukasiewicz",,,,,,,,, Constrained Optimization via Exact Augmented Lagrangian and Randomized Iterative Sketching,"Ilgee Hong, Sen Na, Michael Mahoney, Mladen Kolar",http://arxiv.org/abs/2305.18379,,https://huggingface.co/papers/2305.18379,,,,2305.18379,4,1 Fast Algorithms for Distributed k-Clustering with Outliers,"Junyu Huang, Qilong Feng, Ziyun Huang, Jinhui Xu, Jianxin Wang",,,,,,,,, End-to-End Full-Atom Antibody Design,"Xiangzhe Kong, Wenbing Huang, Yang Liu",http://arxiv.org/abs/2302.00203,,https://huggingface.co/papers/2302.00203,,,,2302.00203,3,0 Disentangled Generative Models for Robust Prediction of System Dynamics,"Stathi Fotiadis, Mario Lino, Shunlong Hu, Stef Garasto, Chris Cantwell, Anil Bharath",http://arxiv.org/abs/2108.11684,,https://huggingface.co/papers/2108.11684,,,,2108.11684,6,1 Understanding Plasticity in Neural Networks,"Clare Lyle, Zeyu Zheng, Evgenii Nikishin, Bernardo Avila Pires, Razvan Pascanu, Will Dabney",http://arxiv.org/abs/2303.01486,,https://huggingface.co/papers/2303.01486,,,,2303.01486,6,0 Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits,"Sunrit Chakraborty, Saptarshi Roy, Ambuj Tewari",http://arxiv.org/abs/2211.05964,,https://huggingface.co/papers/2211.05964,,,,2211.05964,3,0 Conditional Tree Matching for Inference-Time Adaptation of Tree Prediction Models,"Harshit Varma, Abhijeet Awasthi, Sunita Sarawagi",,,,,,,,, Interval Bound Interpolation for Few-shot Learning with Few Tasks,"Shounak Datta, Sankha Subhra Mullick, Anish Chakrabarty, Swagatam Das",http://arxiv.org/abs/2204.03511,,https://huggingface.co/papers/2204.03511,,,,2204.03511,4,0 Estimating Causal Effects using a Multi-task Deep Ensemble,"Ziyang Jiang, Zhuoran Hou, Yiling Liu, Yiman Ren, Keyu Li, David Carlson",http://arxiv.org/abs/2301.11351,,https://huggingface.co/papers/2301.11351,,,,2301.11351,6,2 Automated Search for Conjectures on Mathematical Constants using Analysis of Integer Sequences,"Ofir Razon, Yoav Harris, Shahar Gottlieb, Dan Carmon, Ofir David, Ido Kaminer",http://arxiv.org/abs/2212.09470,,https://huggingface.co/papers/2212.09470,,,,2212.09470,6,0 LM-Design: Structure-informed Language Models Are Protein Designers,"Zaixiang Zheng, Yifan Deng, Dongyu Xue, Yi Zhou, Fei YE, Quanquan Gu",,,,,,,,, TIDE: Time Derivative Diffusion for Deep Learning on Graphs,"Maximilian Krahn, Maysam Behmanesh, Maks Ovsjanikov",http://arxiv.org/abs/2212.02483,,https://huggingface.co/papers/2212.02483,,,,2212.02483,3,1 Surrogate Model Extension (SME): A Fast and Accurate Weight Update Attack on Federated Learning,"Junyi Zhu, Ruicong Yao, Matthew B Blaschko",http://arxiv.org/abs/2306.00127,https://github.com/JunyiZhu-AI/surrogate_model_extension,https://huggingface.co/papers/2306.00127,,,,2306.00127,3,0 "The SSL Interplay: Augmentations, Inductive Bias, and Generalization","Vivien Cabannnes, Bobak T Kiani, Randall Balestriero, Yann LeCun, Alberto Bietti",http://arxiv.org/abs/2302.02774,,https://huggingface.co/papers/2302.02774,,,,2302.02774,5,1 Convergence of first-order methods for nonconvex constrained optimization with dependent data,"Hanbaek Lyu, Ahmet Alacaoglu",,,,,,,,, CogQA: Answering Advanced Questions on Scientific Articles,"Yoonjoo Lee, Kyungjae Lee, Sunghyun Park, Dasol Hwang, Jaehyeon Kim, Hong-in Lee, Moontae Lee",,,,,,,,, Instrumental Variable Estimation of Average Partial Causal Effects,"Yuta Kawakami, manabu kuroki, Jin Tian",,,,,,,,, Improving Adversarial Robustness Through the Contrastive-Guided Diffusion Process,"Yidong Ouyang, Liyan Xie, Guang Cheng",,,,,,,,, MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks,"Wenfang Sun, Yingjun Du, Xiantong Zhen, Fan Wang, Ling Wang, Cees Snoek",http://arxiv.org/abs/2305.10309,,https://huggingface.co/papers/2305.10309,,,,2305.10309,6,0 Provable Dynamic Fusion for Low-Quality Multimodal Data,"qingyang zhang, Haitao Wu, Changqing Zhang, Qinghua Hu, Huazhu Fu, Joey Tianyi Zhou, Xi Peng",http://arxiv.org/abs/2306.02050,,https://huggingface.co/papers/2306.02050,,,,2306.02050,7,1 Beyond Homophily: Reconstructing Structure for Graph-agnostic Clustering,"Erlin Pan, zhao kang",http://arxiv.org/abs/2305.02931,,https://huggingface.co/papers/2305.02931,,,,2305.02931,2,1 SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation,"Huaishao Luo, Junwei Bao, Youzheng Wu, Xiaodong He, Tianrui Li",http://arxiv.org/abs/2211.14813,https://github.com/ArrowLuo/SegCLIP,https://huggingface.co/papers/2211.14813,,,,2211.14813,5,0 Explainability as statistical inference,"Hugo Senetaire, Damien Garreau, Jes Frellsen, Pierre-Alexandre Mattei",http://arxiv.org/abs/2212.03131,,https://huggingface.co/papers/2212.03131,,,,2212.03131,4,1 Learning Prescriptive ReLU Networks,"Wei Sun, Asterios Tsiourvas",http://arxiv.org/abs/2306.00651,,https://huggingface.co/papers/2306.00651,,,,2306.00651,2,1 Bidirectional Adaptation for Robust Semi-Supervised Learning with Inconsistent Data Distributions,"Lin-Han Jia, Lan-Zhe Guo, Zhi Zhou, Jie-Jing Shao, Yuke Xiang, Yu-Feng Li",,,,,,,,, Beyond the Universal Law of Robustness: Sharper Laws for Random Features and Neural Tangent Kernels,"Simone Bombari, Shayan Kiyani, Marco Mondelli",http://arxiv.org/abs/2302.01629,,https://huggingface.co/papers/2302.01629,,,,2302.01629,3,0 Human-Timescale Adaptation in an Open-Ended Task Space,"Jakob Bauer, Kate Baumli, Feryal Behbahani, Avishkar Bhoopchand, Natalie Bradley-Schmieg, Michael Chang, Natalie Clay, Adrian Collister, Vibhavari Dasagi, Lucy Gonzalez, Karol Gregor, Edward Hughes, Sheleem Kashem, Maria Loks-Thompson, Hannah Openshaw, Jack Parker-Holder, Shreya Pathak, Nicolas Perez-Nieves, Nemanja Rakicevic, Tim Rocktäschel, Yannick Schroecker, Satinder Singh, Jakub Sygnowski, Karl Tuyls, Sarah York, Alexander Zacherl, Lei Zhang",http://arxiv.org/abs/2301.07608,,https://huggingface.co/papers/2301.07608,,,,2301.07608,28,1 Analysis of Error Feedback in Federated Non-Convex Optimization with Biased Compression: Linear Speedup and Partial Participation,"Xiaoyun Li, Ping Li",,,,,,,,, ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts,"Minghao Xu, Xinyu Yuan, Santiago Miret, Jian Tang",http://arxiv.org/abs/2301.12040,,https://huggingface.co/papers/2301.12040,,,,2301.12040,4,0 Specializing Smaller Language Models towards Multi-Step Reasoning,"Yao Fu, Hao Peng, Litu Ou, Ashish Sabharwal, Tushar Khot",http://arxiv.org/abs/2301.12726,,https://huggingface.co/papers/2301.12726,,,,2301.12726,5,1 Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap,"Hang Wang, Sen Lin, Junshan Zhang",,,,,,,,, Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models,"Dongjun Kim, Yeongmin Kim, Se Jung Kwon, Wanmo Kang, IL CHUL MOON",http://arxiv.org/abs/2211.17091,https://github.com/alsdudrla10/DG,https://huggingface.co/papers/2211.17091,,,,2211.17091,5,0 Weighted flow diffusion for local graph clustering with node attributes: an algorithm and statistical guarantees,"Shenghao Yang, Kimon Fountoulakis",http://arxiv.org/abs/2301.13187,,https://huggingface.co/papers/2301.13187,,,,2301.13187,2,1 Robust Budget Pacing with a Single Sample,"Santiago Balseiro, Rachitesh Kumar, Vahab Mirrokni, Balasubramanian Sivan, Di Wang",http://arxiv.org/abs/2302.02006,,https://huggingface.co/papers/2302.02006,,,,2302.02006,5,0 Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the Machiavelli Benchmark,"Alexander Pan, Jun Shern Chan, Andy Zou, Nathaniel Li, Scott Emmons, Hanlin Zhang, Steven Basart, Thomas Woodside, Dan Hendrycks",http://arxiv.org/abs/2304.03279,,https://huggingface.co/papers/2304.03279,,,,2304.03279,10,2 Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?,"Victor Boutin, Thomas FEL, Lakshya Singhal, Rishav Mukherji, Akash Nagaraj, Julien Colin, Thomas Serre",http://arxiv.org/abs/2301.11722,,https://huggingface.co/papers/2301.11722,,,,2301.11722,7,1 Random Classification Noise does not defeat All Convex Potential Boosters Irrespective of Model Choice,"Yishay Mansour, Richard Nock, Robert C. Williamson",,,,,,,,, "Fundamental Limits of Two-layer Autoencoders, and Achieving Them with Gradient Methods","Aleksandr Shevchenko, Kevin Kögler, Hamed Hassani, Marco Mondelli",http://arxiv.org/abs/2212.13468,,https://huggingface.co/papers/2212.13468,,,,2212.13468,4,1 Mimetic Initialization of Self-Attention Layers,"Asher Trockman, Zico Kolter",http://arxiv.org/abs/2305.09828,,https://huggingface.co/papers/2305.09828,,,,2305.09828,2,0 Subsample Ridge Ensembles: Equivalences and Generalized Cross-Validation,"Jin-Hong Du, Pratik Patil, Arun Kuchibhotla",http://arxiv.org/abs/2304.13016,,https://huggingface.co/papers/2304.13016,,,,2304.13016,3,1 Second-Order Optimization with Lazy Hessians,"Nikita Doikov, El Mahdi Chayti, Martin Jaggi",http://arxiv.org/abs/2212.00781,,https://huggingface.co/papers/2212.00781,,,,2212.00781,3,2 Generalized Teacher Forcing for Learning Chaotic Dynamics,"Florian Hess, Zahra Monfared, Manuel Brenner, Daniel Durstewitz",http://arxiv.org/abs/2306.04406,,https://huggingface.co/papers/2306.04406,,,,2306.04406,4,0 HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption,"Seewoo Lee, Garam Lee, Jung Woo Kim, Junbum Shin, Mun-Kyu Lee",,,,,,,,, Marginalization is not Marginal: No Bad VAE Local Minima when Learning Optimal Sparse Representations,David Wipf,,,,,,,,, Direct Parameterization of Lipschitz-Bounded Deep Networks,"Ruigang Wang, Ian Manchester",http://arxiv.org/abs/2301.11526,https://github.com/acfr/LBDN,https://huggingface.co/papers/2301.11526,,,,2301.11526,2,1 XAI Beyond Classification: Interpretable Neural Clustering,"Xi Peng, Yunfan Li, Ivor W. Tsang, Hongyuan Zhu, Jiancheng Lv, Joey Tianyi Zhou",http://arxiv.org/abs/1808.07292,,https://huggingface.co/papers/1808.07292,,,,1808.07292,6,1 Exploiting locality in high-dimensional Factorial hidden Markov models,"Lorenzo Rimella, Nick Whiteley",http://arxiv.org/abs/1902.01639,,https://huggingface.co/papers/1902.01639,,,,1902.01639,2,0 Mitigating the Effects of Non-Identifiability on Inference for Bayesian Neural Networks with Latent Variables,"Yaniv Yacoby, Weiwei Pan, Finale Doshi-Velez",http://arxiv.org/abs/1911.00569,,https://huggingface.co/papers/1911.00569,,,,1911.00569,3,0 Project and Forget: Solving Large-Scale Metric Constrained Problems,"Rishi Sonthalia, Anna C. Gilbert",http://arxiv.org/abs/2005.03853,,https://huggingface.co/papers/2005.03853,,,,2005.03853,2,0 "Let's Make Block Coordinate Descent Converge Faster: Faster Greedy Rules, Message-Passing, Active-Set Complexity, and Superlinear Convergence","Julie Nutini, Issam Laradji, Mark Schmidt",http://arxiv.org/abs/1712.08859,,https://huggingface.co/papers/1712.08859,,,,1712.08859,3,1 Cluster-Specific Predictions with Multi-Task Gaussian Processes,"Arthur Leroy, Pierre Latouche, Benjamin Guedj, Servane Gey",http://arxiv.org/abs/2011.07866,,https://huggingface.co/papers/2011.07866,,,,2011.07866,4,1 Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials,"Daiqi Gao, Yufeng Liu, Donglin Zeng",,,,,,,,, Mean-field Analysis of Piecewise Linear Solutions for Wide ReLU Networks,"Aleksandr Shevchenko, Vyacheslav Kungurtsev, Marco Mondelli",http://arxiv.org/abs/2111.02278,,https://huggingface.co/papers/2111.02278,,,,2111.02278,3,1 "Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism","Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos",http://arxiv.org/abs/2012.11579,,https://huggingface.co/papers/2012.11579,,,,2012.11579,4,0 "Distributed Stochastic Gradient Descent: Nonconvexity, Nonsmoothness, and Convergence to Local Minima","Brian Swenson, Ryan Murray, H. Vincent Poor, Soummya Kar",http://arxiv.org/abs/2003.02818,,https://huggingface.co/papers/2003.02818,,,,2003.02818,4,0 Towards Learning to Imitate from a Single Video Demonstration,"Glen Berseth, Florian Golemo, Christopher Pal",http://arxiv.org/abs/1901.07186,,https://huggingface.co/papers/1901.07186,,,,1901.07186,3,0 Faith-Shap: The Faithful Shapley Interaction Index,"Che-Ping Tsai, Chih-Kuan Yeh, Pradeep Ravikumar",,,,,,,,, Knowledge Hypergraph Embedding Meets Relational Algebra,"Bahare Fatemi, Perouz Taslakian, David Vazquez, David Poole",http://arxiv.org/abs/2102.09557,,https://huggingface.co/papers/2102.09557,,,,2102.09557,4,0 Deep linear networks can benignly overfit when shallow ones do,"Niladri S. Chatterji, Phil Long",http://arxiv.org/abs/2209.09315,,https://huggingface.co/papers/2209.09315,,,,2209.09315,2,0 Taming graph kernels with random features,Krzysztof Choromanski,http://arxiv.org/abs/2305.00156,,https://huggingface.co/papers/2305.00156,,,,2305.00156,1,0 On Uni-Modal Feature Learning in Supervised Multi-Modal Learning,"Chenzhuang Du, Jiaye Teng, Tingle Li, Yichen Liu, Tianyuan Yuan, Yue Wang, Yang Yuan, Hang Zhao",http://arxiv.org/abs/2305.01233,,https://huggingface.co/papers/2305.01233,,,,2305.01233,8,0 CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations,"Gengchen Mai, Ni Lao, Yutong He, Jiaming Song, Stefano Ermon",http://arxiv.org/abs/2305.01118,,https://huggingface.co/papers/2305.01118,,,,2305.01118,5,2 CLIPood: Generalizing CLIP to Out-of-Distributions,"Yang Shu, Xingzhuo Guo, Jialong Wu, Ximei Wang, Jianmin Wang, Mingsheng Long",http://arxiv.org/abs/2302.00864,,https://huggingface.co/papers/2302.00864,,,,2302.00864,6,0 Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning,"Yu Meng, Martin Michalski, Jiaxin Huang, Yu Zhang, Tarek Abdelzaher, Jiawei Han",http://arxiv.org/abs/2211.03044,,https://huggingface.co/papers/2211.03044,,,,2211.03044,6,1 Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optimization,"Jiwoo Son, Minsu Kim, Hyeonah Kim, Jinkyoo Park",,,,,,,,, ChiPFormer: Transferable Chip Placement via Offline Decision Transformer,"Yao LAI, Jinxin Liu, Zhentao Tang, Bin Wang, Jianye Hao, Ping Luo",,,,,,,,, OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Model,"Enshu Liu, Xuefei Ning, Zinan Lin, Huazhong Yang, Yu Wang",,,,,,,,, Effectively Using Public Data in Privacy Preserving Machine Learning,"Milad Nasresfahani, Saeed Mahloujifar, Xinyu Tang, Prateek Mittal, Amir Houmansadr",,,,,,,,, Feed Two Birds with One Scone: Exploiting Wild Data for Both Out-of-Distribution Generalization and Detection,"Haoyue Bai, Gregory Canal, Xuefeng Du, Jeongyeol Kwon, Robert Nowak, Yixuan Li",http://arxiv.org/abs/2306.09158,https://github.com/deeplearning-wisc/scone,https://huggingface.co/papers/2306.09158,,,,2306.09158,6,0 Discovering Object-Centric Generalized Value Functions From Pixels,"Somjit Nath, Gopeshh Subbaraj, Khimya Khetarpal, Samira Ebrahimi Kahou",http://arxiv.org/abs/2304.13892,,https://huggingface.co/papers/2304.13892,,,,2304.13892,4,1 Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning,"Thomas Carta, Clément Romac, Thomas Wolf, sylvain lamprier, Olivier Sigaud, Pierre-Yves Oudeyer",http://arxiv.org/abs/2302.02662,,https://huggingface.co/papers/2302.02662,,,,2302.02662,6,3 FedBR: Improving Federated Learning on Heterogeneous Data via Local Learning Bias Reduction,"Yongxin Guo, Xiaoying Tang, Tao Lin",http://arxiv.org/abs/2205.13462,https://github.com/lins-lab/fedbr,https://huggingface.co/papers/2205.13462,,,,2205.13462,3,0 Shiftable Context: Addressing Training-Inference Context Mismatch in Simultaneous Speech Translation,"Matthew Raffel, Drew Penney, Lizhong Chen",,,,,,,,, Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning,"Yu Yang, Besmira Nushi, Hamid Palangi, Baharan Mirzasoleiman",http://arxiv.org/abs/2304.03916,,https://huggingface.co/papers/2304.03916,,,,2304.03916,4,1 Great Models Think Alike: Improving Model Reliability via Inter-Model Latent Agreement,"Ailin Deng, Miao Xiong, Bryan Hooi",http://arxiv.org/abs/2305.01481,,https://huggingface.co/papers/2305.01481,,,,2305.01481,3,0 Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection,"XiaoHui Zhang, Jiangyan Yi, Jianhua Tao, Chenglong Wang, Chu Yuan Zhang",,,,,,,,, FAIRER: Fairness as Decision Rationale Alignment,"Li Tianlin, Qing Guo, Aishan Liu, Mengnan Du, Zhiming Li, Yang Liu",,,,,,,,, Deep Graph Representation Learning and Optimization for Influence Maximization,"Chen Ling, Junji Jiang, Junxiang Wang, My T. Thai, Renhao Xue, James Song, Meikang Qiu, Liang Zhao",http://arxiv.org/abs/2305.02200,https://github.com/triplej0079/DeepIM,https://huggingface.co/papers/2305.02200,,,,2305.02200,8,1 Improving Fair Training under Correlation Shifts,"Yuji Roh, Kangwook Lee, Steven Whang, Changho Suh",http://arxiv.org/abs/2302.02323,,https://huggingface.co/papers/2302.02323,,,,2302.02323,4,0 SEGA: Structural Entropy Guided Anchor View for Graph Contrastive Learning,"Junran Wu, Xueyuan Chen, Bowen Shi, Shangzhe Li, Ke Xu",http://arxiv.org/abs/2305.04501,,https://huggingface.co/papers/2305.04501,,,,2305.04501,5,0 ConCerNet: A Contrastive Learning Based Framework for Automated Conservation Law Discovery and Trustworthy Dynamical System Prediction,"Wang Zhang, Lily Weng, Subhro Das, Alexandre Megretsky, Luca Daniel, Lam Nguyen",http://arxiv.org/abs/2302.05783,,https://huggingface.co/papers/2302.05783,,,,2302.05783,6,2 Generative Causal Representation Learning for Out-of-Distribution Motion Forecasting,"Shayan Shirahmad Gale Bagi, Zahra Gharaee, Oliver Schulte, Mark Crowley",http://arxiv.org/abs/2302.08635,https://github.com/sshirahmad/GCRL,https://huggingface.co/papers/2302.08635,,,,2302.08635,4,1 Meta Optimal Transport,"Brandon Amos, Giulia Luise, samuel cohen, Ievgen Redko",http://arxiv.org/abs/2206.05262,,https://huggingface.co/papers/2206.05262,,,,2206.05262,4,1 InfoDiffusion: Representation Learning Using Information Maximizing Diffusion Models,"Yingheng Wang, Yair Schiff, Aaron Gokaslan, Weishen Pan, Fei Wang, Chris De Sa, Volodymyr Kuleshov",http://arxiv.org/abs/2306.08757,,https://huggingface.co/papers/2306.08757,,,,2306.08757,7,0 CocktailSGD: Fine-tuning Foundation Models over 500Mbps Networks,"Jue Wang, Yucheng Lu, Binhang Yuan, Beidi Chen, Percy Liang, Chris De Sa, Christopher Re, Ce Zhang",,,,,,,,, Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning,"Cheng Lu, Huayu Chen, Jianfei Chen, Hang Su, Chongxuan Li, Jun Zhu",http://arxiv.org/abs/2304.12824,,https://huggingface.co/papers/2304.12824,,,,2304.12824,6,0 BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models,"Junnan Li, DONGXU LI, Silvio Savarese, Steven Hoi",https://arxiv.org/abs/2301.12597,https://github.com/salesforce/LAVIS/tree/main/projects/blip2,https://huggingface.co/papers/2301.12597,https://huggingface.co/spaces/Salesforce/BLIP2,https://huggingface.co/Salesforce/blip2-flan-t5-xxl,,2301.12597,4,1 The Benefits of Mixup for Feature Learning,"Difan Zou, Yuan Cao, Yuanzhi Li, Quanquan Gu",http://arxiv.org/abs/2303.08433,,https://huggingface.co/papers/2303.08433,,,,2303.08433,4,1 GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks,"Salah GHAMIZI, Jingfeng ZHANG, Maxime Cordy, Mike Papadakis, Masashi Sugiyama, YVES LE TRAON",http://arxiv.org/abs/2302.02907,,https://huggingface.co/papers/2302.02907,,,,2302.02907,6,0 Test-time Adaptation with Slot-Centric Models,"Mihir Prabhudesai, Anirudh Goyal, Sujoy Paul, Sjoerd van Steenkiste, Mehdi S. M. Sajjadi, Gaurav Aggarwal, Thomas Kipf, Deepak Pathak, Katerina Fragkiadaki",http://arxiv.org/abs/2203.11194,,https://huggingface.co/papers/2203.11194,,,,2203.11194,9,2 Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification,"Dong Xing, Pengjie Gu, Qian Zheng, Xinrun Wang, Shanqi Liu, Longtao Zheng, Bo An, Gang Pan",,,,,,,,, Data-Efficient Contrastive Self-supervised Learning: Most Beneficial Examples for Supervised Learning Contribute the Least,"Siddharth Joshi, Baharan Mirzasoleiman",http://arxiv.org/abs/2302.09195,,https://huggingface.co/papers/2302.09195,,,,2302.09195,2,1 Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs,"Guan-Ting Liu, En-Pei Hu, Pu-Jen Cheng, Hung-yi Lee, Shao-Hua Sun",http://arxiv.org/abs/2301.12950,,https://huggingface.co/papers/2301.12950,,,,2301.12950,5,0 Cooperative Open-ended Learning Framework for Zero-Shot Coordination,"Yang Li, Shao Zhang, Jichen Sun, Yali Du, Ying Wen, Xinbing Wang, Wei Pan",http://arxiv.org/abs/2302.04831,,https://huggingface.co/papers/2302.04831,,,,2302.04831,7,1 CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design,"Desi Ivanova, Joel Jennings, Tom Rainforth, Cheng Zhang, Adam Foster",,,,,,,,, On the Identifiability and Estimation of Causal Location-Scale Noise Models,"Alexander Immer, Christoph Schultheiss, Julia Vogt, Bernhard Schölkopf, Peter Bühlmann, Alexander Marx",http://arxiv.org/abs/2210.09054,,https://huggingface.co/papers/2210.09054,,,,2210.09054,6,1 From Temporal to Contemporaneous Iterative Causal Discovery in the Presence of Latent Confounders,"Raanan Yehezkel Rohekar, Shami Nisimov, Yaniv Gurwicz, Gal Novik",http://arxiv.org/abs/2306.00624,,https://huggingface.co/papers/2306.00624,,,,2306.00624,4,0 Reinforcement Learning in Low-rank MDPs with Density Features,"Audrey Huang, Jinglin Chen, Nan Jiang",http://arxiv.org/abs/2302.02252,,https://huggingface.co/papers/2302.02252,,,,2302.02252,3,1 Improving Adversarial Robustness of Deep Equilibrium Models with Explicit Regulations Along the Neural Dynamics,"Zonghan Yang, Peng Li, Tianyu Pang, Yang Liu",,,,,,,,, When and How Does Known Class Help Discover Unknown Ones? Provable Understanding Through Spectral Analysis,"Yiyou Sun, Zhenmei Shi, Yingyiu Liang, Yixuan Li",,,,,,,,, Reachability-Aware Laplacian Representation in Reinforcement Learning,"Kaixin Wang, Kuangqi Zhou, Jiashi Feng, Bryan Hooi, Xinchao Wang",http://arxiv.org/abs/2210.13153,,https://huggingface.co/papers/2210.13153,,,,2210.13153,5,0 A New PHO-rmula for Improved Performance of Semi-Structured Networks,David Rügamer,http://arxiv.org/abs/2306.00522,,https://huggingface.co/papers/2306.00522,,,,2306.00522,1,0 Provably Convergent Schrödinger Bridge with Applications to Probabilistic Time Series Imputation,"Yu Chen, Wei Deng, Shikai Fang, Fengpei Li, Tianjiao N Yang, Yikai Zhang, Kashif Rasul, Shandian Zhe, Anderson Schneider, Yuriy Nevmyvaka",,,,,,,,, Conformal Prediction for Federated Uncertainty Quantification Under Label Shift,"Mehdi Makni, Vincent Plassier, Aleksandr Rubashevskii, Eric Moulines, Maxim Panov",http://arxiv.org/abs/2306.05131,,https://huggingface.co/papers/2306.05131,,,,2306.05131,5,1 Revisiting the Linear-Programming Framework for Offline RL with General Function Approximation,"Asuman Ozdaglar, Sarath Pattathil, Jiawei Zhang, Kaiqing Zhang",,,,,,,,, Non-autoregressive Conditional Diffusion Models for Time Series Prediction,"Lifeng Shen, James Kwok",http://arxiv.org/abs/2306.05043,,https://huggingface.co/papers/2306.05043,,,,2306.05043,2,0 Drug Discovery under Covariate Shift with Domain-Informed Prior Distributions over Functions,"Leo Klarner, Tim G. J. Rudner, Michael Reutlinger, Torsten Schindler, Garrett Morris, Charlotte Deane, Yee-Whye Teh",,,,,,,,, SOM-CPC: Unsupervised Contrastive Learning with Self-Organizing Maps for Structured Representations of High-Rate Time Series,"Iris Huijben, Arthur A. Nijdam, Sebastiaan Overeem, Merel Van Gilst, Ruud J. G. van Sloun",,,,,,,,, Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction,"Minghao Guo, Veronika Thost, Samuel Song, Adithya Balachandran, Payel Das, Jie Chen, Wojciech Matusik",,,,,,,,, A Closer Look at the Intervention Procedure of Concept Bottleneck Models,"Sungbin Shin, Yohan Jo, Sungsoo Ahn, Namhoon Lee",http://arxiv.org/abs/2302.14260,,https://huggingface.co/papers/2302.14260,,,,2302.14260,4,1 Simple Hardware-Efficient Long Convolutions for Sequence Modeling,"Daniel Y Fu, Elliot L Epstein, Eric Nguyen, Michael Zhang, Tri Dao, Atri Rudra, Christopher Re",http://arxiv.org/abs/2302.06646,,https://huggingface.co/papers/2302.06646,,,,2302.06646,8,2 Towards Controlled Data Augmentations for Active Learning,"Jianan Yang, Jianan Yang, Haobo Wang, Sai Wu, Gang Chen, Junbo Zhao",,,,,,,,, "Bigger, Better, Faster: Human-level Atari with human-level efficiency","Max Schwarzer, Johan Obando Ceron, Aaron Courville, Marc Bellemare, Rishabh Agarwal, Pablo Samuel Castro",http://arxiv.org/abs/2305.19452,https://github.com/google-research/google-research/tree/master/bigger_better_faster,https://huggingface.co/papers/2305.19452,,,,2305.19452,6,3 A Law of Robustness beyond Isoperimetry,"Yihan Wu, Heng Huang, Hongyang Zhang",http://arxiv.org/abs/2202.11592,,https://huggingface.co/papers/2202.11592,,,,2202.11592,3,0 Federated Conformal Predictors for Distributed Uncertainty Quantification,"Charles Lu, Yaodong Yu, Sai Karimireddy, Michael Jordan, Ramesh Raskar",http://arxiv.org/abs/2305.17564,https://github.com/clu5/federated-conformal,https://huggingface.co/papers/2305.17564,,,,2305.17564,5,1 Mirror Sinkhorn: Fast Online Optimization on Transport Polytopes,"Marin Ballu, Quentin Berthet",http://arxiv.org/abs/2211.10420,,https://huggingface.co/papers/2211.10420,,,,2211.10420,2,0 From Noisy Fixed-Point Iterations to Private ADMM for Centralized and Federated Learning,"Edwige Cyffers, Aurélien Bellet, Debabrota Basu",http://arxiv.org/abs/2302.12559,,https://huggingface.co/papers/2302.12559,,,,2302.12559,3,0 Pareto Regret Analyses in Multi-objective Multi-armed Bandit,"Mengfan Xu, Diego Klabjan",http://arxiv.org/abs/2212.00884,,https://huggingface.co/papers/2212.00884,,,,2212.00884,2,0 Online Nonstochastic Control with Adversarial and Static Constraints,"Xin Liu, Zixian Yang, Lei Ying",http://arxiv.org/abs/2302.02426,,https://huggingface.co/papers/2302.02426,,,,2302.02426,3,0 Constrained Efficient Global Optimization of Expensive Black-box Functions,"Wenjie Xu, Yuning Jiang, Bratislav Svetozarevic, Colin Jones",http://arxiv.org/abs/2211.00162,,https://huggingface.co/papers/2211.00162,,,,2211.00162,4,0 Generalized Polyak Step Size for First Order Optimization with Momentum,"Xiaoyu Wang, Mikael Johansson, Tong Zhang",http://arxiv.org/abs/2305.12939,,https://huggingface.co/papers/2305.12939,,,,2305.12939,3,0 Complexity of block coordinate descent with proximal regularization and applications to Wasserstein CP-dictionary learning,"Dohyun Kwon, Hanbaek Lyu",http://arxiv.org/abs/2306.02420,,https://huggingface.co/papers/2306.02420,,,,2306.02420,2,0 FlexGen: High-throughput Generative Inference of Large Language Models with a Single GPU,"Ying Sheng, Lianmin Zheng, Binhang Yuan, Zhuohan Li, Max Ryabinin, Beidi Chen, Percy Liang, Christopher Re, Ion Stoica, Ce Zhang",http://arxiv.org/abs/2303.06865,https://github.com/FMInference/FlexGen,https://huggingface.co/papers/2303.06865,,,,2303.06865,14,3 Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories,"Qinqing Zheng, Mikael Henaff, Brandon Amos, Aditya Grover",http://arxiv.org/abs/2210.06518,,https://huggingface.co/papers/2210.06518,,,,2210.06518,4,1 Two Losses Are Better Than One: Faster Optimization Using a Cheaper Proxy,"Blake Woodworth, Konstantin Mishchenko, Francis Bach",http://arxiv.org/abs/2302.03542,,https://huggingface.co/papers/2302.03542,,,,2302.03542,3,1 Conditionally Strongly Log-Concave Generative Models,"Florentin Guth, Etienne Lempereur, Joan Bruna, Stéphane Mallat",http://arxiv.org/abs/2306.00181,,https://huggingface.co/papers/2306.00181,,,,2306.00181,4,0 Orthogonality-Enforced Latent Space in Autoencoders: An Approach to Learning Disentangled Representations,"jaehoon cha, Jeyan Thiyagalingam",,,,,,,,, Rotation and Translation Invariant Representation Learning with Implicit Neural Representations,"Sehyun Kwon, Joo Young Choi, Ernest Ryu",http://arxiv.org/abs/2304.13995,,https://huggingface.co/papers/2304.13995,,,,2304.13995,3,0 Self-supervised learning of Split Invariant Equivariant representations,"Quentin Garrido, Laurent Najman, Yann LeCun",http://arxiv.org/abs/2302.10283,,https://huggingface.co/papers/2302.10283,,,,2302.10283,3,1 Hybrid Energy Based Model in the Feature Space for Out-of-Distribution Detection,"Marc Lafon, Elias Ramzi, Clément Rambour, Nicolas THOME",http://arxiv.org/abs/2305.16966,https://github.com/MarcLafon/heatood,https://huggingface.co/papers/2305.16966,,,,2305.16966,4,0 Hyperbolic Image-text Representations,"Karan Desai, Maximilian Nickel, Tanmay Rajpurohit, Justin Johnson, Ramakrishna Vedantam",http://arxiv.org/abs/2304.09172,,https://huggingface.co/papers/2304.09172,,,,2304.09172,5,1 Analyzing Privacy Leakage in Machine Learning via Multiple Hypothesis Testing: A Lesson From Fano,"Chuan Guo, Alexandre Sablayrolles, Maziar Sanjabi",http://arxiv.org/abs/2210.13662,,https://huggingface.co/papers/2210.13662,,,,2210.13662,3,0 Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames,"Ondrej Biza, Sjoerd van Steenkiste, Mehdi S. M. Sajjadi, Gamaleldin Elsayed, Aravindh Mahendran, Thomas Kipf",http://arxiv.org/abs/2302.04973,,https://huggingface.co/papers/2302.04973,,,,2302.04973,6,1 The photo-sketch correspondence problem: a new benchmark and a self-supervised approach,"Xuanchen Lu, Xiaolong Wang, Judith E. Fan",,,,,,,,, Distilling Internet-Scale Vision-Language Models into Embodied Agents,"Theodore R Sumers, Kenneth Marino, Arun Ahuja, Rob Fergus, Ishita Dasgupta",http://arxiv.org/abs/2301.12507,,https://huggingface.co/papers/2301.12507,,,,2301.12507,5,0 MyoDex: A Generalizable Prior for Dexterous Manipulation,"Vittorio Caggiano, Sudeep Dasari, Vikash Kumar",,,,,,,,, Jump-Start Reinforcement Learning,"Ikechukwu Uchendu, Ted Xiao, Yao Lu, Banghua Zhu, Mengyuan Yan, Joséphine Simon, Matthew Bennice, Chuyuan Fu, Cong Ma, Jiantao Jiao, Sergey Levine, Karol Hausman",http://arxiv.org/abs/2204.02372,,https://huggingface.co/papers/2204.02372,,,,2204.02372,12,1 Adaptive Coordination in Social Embodied Rearrangement,"Andrew Szot, Unnat Jain, Dhruv Batra, Zsolt Kira, Ruta Desai, Akshara Rai",http://arxiv.org/abs/2306.00087,,https://huggingface.co/papers/2306.00087,,,,2306.00087,6,0 ContraBAR: Contrastive Bayes-Adaptive Deep RL,"Era Choshen, Aviv Tamar",http://arxiv.org/abs/2306.02418,,https://huggingface.co/papers/2306.02418,,,,2306.02418,2,0 Guiding Pretraining in Reinforcement Learning with Large Language Models,"Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas",http://arxiv.org/abs/2302.06692,,https://huggingface.co/papers/2302.06692,,,,2302.06692,8,0 PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient,"Kaixin Wang, Zhou Daquan, Jiashi Feng, Shie Mannor",,,,,,,,, Differentially Private Sharpness-Aware Training,"Jinseong Park, Hoki Kim, Yujin Choi, Jaewook Lee",http://arxiv.org/abs/2306.05651,https://github.com/jinseongP/DPSAT,https://huggingface.co/papers/2306.05651,,,,2306.05651,4,0 Provably and Practically Efficient Neural Contextual Bandits,Sudeep Salgia,http://arxiv.org/abs/2206.00099,,https://huggingface.co/papers/2206.00099,,,,2206.00099,3,1 How Does Information Bottleneck Help Deep Learning?,"Kenji Kawaguchi, Zhun Deng, Xu Ji, Jiaoyang Huang",http://arxiv.org/abs/2305.18887,https://github.com/xu-ji/information-bottleneck,https://huggingface.co/papers/2305.18887,,,,2305.18887,4,0 Why Is Public Pretraining Necessary for Private Model Training?,"Arun Ganesh, Mahdi Haghifam, Milad Nasresfahani, Sewoong Oh, Thomas Steinke, Om Thakkar, Abhradeep Guha Thakurta, Lun Wang",http://arxiv.org/abs/2302.09483,,https://huggingface.co/papers/2302.09483,,,,2302.09483,8,0 Learning Instance-Specific Augmentations by Capturing Local Invariances,"Ning Miao, Tom Rainforth, Emile Mathieu, Yann Dubois, Yee-Whye Teh, Adam Foster, Hyunjik Kim",http://arxiv.org/abs/2206.00051,,https://huggingface.co/papers/2206.00051,,,,2206.00051,7,0 On Balancing Bias and Variance in Unsupervised Multi-Source-Free Domain Adaptation,"Maohao Shen, Yuheng Bu, Gregory Wornell",http://arxiv.org/abs/2202.00796,,https://huggingface.co/papers/2202.00796,,,,2202.00796,3,1 NTK-approximating MLP Fusion for Efficient Language Model Fine-tuning,"Tianxin Wei, Zeming Guo, Yifan Chen, Jingrui He",,,,,,,,, FusionRetro: Molecule Representation Fusion via In-Context Learning for Retrosynthetic Planning,"Songtao Liu, Zhengkai Tu, Minkai Xu, Zuobai Zhang, Lu Lin, Rex Ying, Jian Tang, Peilin Zhao, Dinghao Wu",http://arxiv.org/abs/2209.15315,https://github.com/SongtaoLiu0823/FusionRetro,https://huggingface.co/papers/2209.15315,,,,2209.15315,9,2 Is Overfitting Necessary for Implicit Video Representation? ,"HEE MIN CHOI, Hyoa Kang, Dokwan Oh",,,,,,,,, Neural Prediction Errors enable Analogical Visual Reasoning in Human Standard Intelligence Tests,"Lingxiao YANG, Hongzhi You, Zonglei Zhen, Dahui Wang, Xiaohong Wan, Xiaohua Xie, Ru-Yuan Zhang",,,,,,,,, AdaNPC: Exploring Non-Parametric Classifier for Test-Time Adaptation,"Yi-Fan Zhang, xue wang, Kexin Jin, Kun Yuan, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan",http://arxiv.org/abs/2304.12566,https://github.com/yfzhang114/AdaNPC,https://huggingface.co/papers/2304.12566,,,,2304.12566,8,1 The Wisdom of Hindsight Makes Language Models Better Instruction Followers,"Tianjun Zhang, Fangchen Liu, Justin Wong, Pieter Abbeel, Joseph E Gonzalez",http://arxiv.org/abs/2302.05206,,https://huggingface.co/papers/2302.05206,,,,2302.05206,5,1 Adversarial Collaborative Learning on Non-IID Features,"Qinbin Li, Bingsheng He, Dawn Song",,,,,,,,, LSDS++ : Dual Sampling for Accelerated k-means++,"Chenglin Fan, Ping Li, Xiaoyun Li",,,,,,,,, Gradient-based Wang--Landau Algorithm: A Novel Sampler for Output Distribution of Neural Networks over the Input Space,"Weitang Liu, Yi-Zhuang You, Ying Wai Li, Jingbo Shang",,,,,,,,, On Sampling with Approximate Transport Maps,"Louis Grenioux, Alain Oliviero Durmus, Eric Moulines, Marylou Gabrié",http://arxiv.org/abs/2302.04763,,https://huggingface.co/papers/2302.04763,,,,2302.04763,4,0 A Mathematical Model for Curriculum Learning for Parities,"Elisabetta Cornacchia, Elchanan Mossel",,,,,,,,, Differentiable Simulations for Enhanced Sampling of Rare Events,"Martin Šípka, Johannes Dietschreit, Lukáš Grajciar, Rafael Gomez-Bombarelli",http://arxiv.org/abs/2301.03480,,https://huggingface.co/papers/2301.03480,,,,2301.03480,4,0 Gaussian processes at the Helm(holtz): A more fluid model for ocean currents,"Renato Berlinghieri, Brian Trippe, David Burt, Ryan Giordano, Kaushik Srinivasan, Tamay Özgökmen, Junfei Xia, Tamara Broderick",http://arxiv.org/abs/2302.10364,,https://huggingface.co/papers/2302.10364,,,,2302.10364,8,1 Dual Propagation: Accelerating Contrastive Hebbian Learning with Dyadic Neurons,"Rasmus Kjær Høier, D. Staudt, Christopher Zach",http://arxiv.org/abs/2302.01228,,https://huggingface.co/papers/2302.01228,,,,2302.01228,3,0 Quantized Distributed Training of Large Models with Convergence Guarantees,"Ilia Markov, Adrian Vladu, Qi Guo, Dan Alistarh",http://arxiv.org/abs/2302.02390,,https://huggingface.co/papers/2302.02390,,,,2302.02390,4,0 SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models,"Guangxuan Xiao, Ji Lin, Mickael Seznec, Hao Wu, Julien Demouth, Song Han",http://arxiv.org/abs/2211.10438,https://github.com/mit-han-lab/smoothquant,https://huggingface.co/papers/2211.10438,,,,2211.10438,6,3 Efficiently predicting high resolution mass spectra with graph neural networks,"Michael Murphy, Stefanie Jegelka, Ernest Fraenkel, Tobias Kind, David Healey, Thomas Butler",http://arxiv.org/abs/2301.11419,,https://huggingface.co/papers/2301.11419,,,,2301.11419,6,1 Learning to Design Analog Circuits to Meet Threshold Specifications,"Dmitrii Krylov, Pooya Khajeh, Junhan Ouyang, Thomas Reeves, Tongkai Liu, Hiba Ajmal, Hamidreza Aghasi, Roy Fox",,,,,,,,, Can Large Language Models Reason about Program Behavior?,"Kexin Pei, David Bieber, Kensen Shi, Charles Sutton, Pengcheng Yin",,,,,,,,, Overcoming Simplicity Bias in Deep Networks using a Feature Sieve,"Rishabh Tiwari, Pradeep Shenoy",http://arxiv.org/abs/2301.13293,,https://huggingface.co/papers/2301.13293,,,,2301.13293,2,0 Low-Variance Gradient Estimation in Unrolled Computation Graphs with ES-Single,"Paul Vicol, Zico Kolter, Kevin Swersky",http://arxiv.org/abs/2304.11153,,https://huggingface.co/papers/2304.11153,,,,2304.11153,3,0 Robust One-Class Classification with Signed Distance Function using 1-Lipschitz Neural Networks,"Louis Bethune, Paul Novello, Thibaut Boissin, Guillaume Coiffier, Mathieu Serrurier, Quentin VINCENOT, Andres Troya-Galvis",,,,,,,,, The Role of Entropy and Reconstruction for Multi-View Self-Supervised Learning,"Borja Rodríguez Gálvez, Arno Blaas, Pau Rodriguez, Adam Golinski, Xavi Suau, Jason Ramapuram, Dan Busbridge, Luca Zappella",,,,,,,,, Gradient Descent in Neural Networks as Sequential Learning in Reproducing Kernel Banach Space,"Alistair Shilton, Sunil Gupta, Santu Rana, Svetha Venkatesh",,,,,,,,, Width and Depth Limits Commute in Residual Networks,"Soufiane Hayou, Greg Yang",http://arxiv.org/abs/2302.00453,,https://huggingface.co/papers/2302.00453,,,,2302.00453,2,0 How much does Initialization Affect Generalization?,"Sameera Ramasinghe, Lachlan E. MacDonald, Moshiur Farazi, Hemanth Saratchandran, Simon Lucey",,,,,,,,, How Powerful are Shallow Neural Networks with Bandlimited Random Weights?,"Ming Li, Sho Sonoda, Feilong Cao, Yu Guang Wang, Jiye Liang",http://arxiv.org/abs/2008.08427,,https://huggingface.co/papers/2008.08427,,,,2008.08427,5,0 On the Convergence of SARSA with Linear Function Approximation,"Shangtong Zhang, Remi Tachet des Combes, Romain Laroche",http://arxiv.org/abs/2202.06828,,https://huggingface.co/papers/2202.06828,,,,2202.06828,3,0 Group Equivariant Fourier Neural Operators for Partial Differential Equations,"Jacob Helwig, Xuan Zhang, Cong Fu, Jerry Kurtin, Stephan Wojtowytsch, Shuiwang Ji",http://arxiv.org/abs/2306.05697,https://github.com/divelab/AIRS,https://huggingface.co/papers/2306.05697,,,,2306.05697,6,0 SE(3) diffusion model with application to protein backbone generation,"Jason Yim, Brian Trippe, Valentin De Bortoli, Emile Mathieu, Arnaud Doucet, Regina Barzilay, Tommi Jaakkola",http://arxiv.org/abs/2302.02277,,https://huggingface.co/papers/2302.02277,,,,2302.02277,7,0 LongCoder: A Long-Range Pre-trained Language Model for Code Completion,"Daya Guo, Canwen Xu, Nan Duan, Jian Yin, Julian McAuley",,,,,,,,, Deep Temporal Sets with Evidential Reinforced Attentions for Unique Behavioral Pattern Discovery,"Dingrong Wang, Deep Pandey, Krishna Neupane, Zhiwei Yu, Ervine Zheng, Zhi Zheng, Qi Yu",,,,,,,,, Bayesian Progressive Deep Topic Model with Knowledge Informed Textual Data Coarsening Process,"Zhibin Duan, Xinyang Liu, Yudi Su, Yishi Xu, Bo Chen, Mingyuan Zhou",,,,,,,,, Distortion and Uncertainty Aware Loss for Panoramic Depth Completion,"Zhiqiang Yan, Xiang Li, Kun Wang, Shuo Chen, Jun Li, Jian Yang",,,,,,,,, Universal Morphology Control via Contextual Modulation,"Zheng Xiong, Jacob Beck, Shimon Whiteson",http://arxiv.org/abs/2302.11070,,https://huggingface.co/papers/2302.11070,,,,2302.11070,3,0 SlotGAT: Slot-based Message Passing for Heterogeneous Graphs,"Ziang Zhou, Jieming Shi, Renchi Yang, Yuanhang Zou, Qing Li",,,,,,,,, A Group Symmetric Stochastic Differential Equation Model for Molecule Multi-modal Pretraining,"Shengchao Liu, weitao du, Zhiming Ma, Hongyu Guo, Jian Tang",http://arxiv.org/abs/2305.18407,,https://huggingface.co/papers/2305.18407,,,,2305.18407,5,0 LazyGNN: Large-Scale Graph Neural Networks via Lazy Propagation,"Rui Xue, Haoyu Han, MohamadAli Torkamani, Jian Pei, Xiaorui Liu",http://arxiv.org/abs/2302.01503,https://github.com/RXPHD/Lazy_GNN,https://huggingface.co/papers/2302.01503,,,,2302.01503,5,0 GNOT: A General Neural Operator Transformer for Operator Learning,"Zhongkai Hao, Zhengyi Wang, Hang Su, Chengyang Ying, Yinpeng Dong, LIU SONGMING, Ze Cheng, Jian Song, Jun Zhu",http://arxiv.org/abs/2302.14376,https://github.com/thu-ml/GNOT,https://huggingface.co/papers/2302.14376,,,,2302.14376,9,1 Graph Positional Encoding via Random Feature Propagation,"Moshe Eliasof, Fabrizio Frasca, Beatrice Bevilacqua, Eran Treister, Gal Chechik, Haggai Maron",http://arxiv.org/abs/2303.02918,,https://huggingface.co/papers/2303.02918,,,,2303.02918,6,0 Towards Better Graph Representation Learning with Parameterized Decomposition & Filtering,"Mingqi Yang, Wenjie Feng, Yanming Shen, Bryan Hooi",http://arxiv.org/abs/2305.06102,https://github.com/qslim/PDF,https://huggingface.co/papers/2305.06102,,,,2305.06102,4,0 Local Vertex Colouring Graph Neural Networks,"Shouheng Li, Dongwoo Kim, Qing Wang",,,,,,,,, Composer: Creative and Controllable Image Synthesis with Composable Conditions,"Lianghua Huang, Di Chen, Yu Liu, Yujun Shen, Deli Zhao, Jingren Zhou",http://arxiv.org/abs/2302.09778,,https://huggingface.co/papers/2302.09778,,,,2302.09778,6,0 Transformers Meet Directed Graphs,"Simon Markus Geisler, Yujia Li, Daniel Mankowitz, Taylan Cemgil, Stephan Günnemann, Cosmin Paduraru",http://arxiv.org/abs/2302.00049,,https://huggingface.co/papers/2302.00049,,,,2302.00049,6,1 Robust Camera Pose Refinement for Multi-Resolution Hash Encoding,"Hwan Heo, Taekyung Kim, Jiyoung Lee, Jaewon Lee, Soohyun Kim, Hyunwoo Kim, Jin-Hwa Kim",http://arxiv.org/abs/2302.01571,,https://huggingface.co/papers/2302.01571,,,,2302.01571,7,0 Enforcing Hard Constraints with Soft Barriers: Safe-driven Reinforcement Learning in Unknown Stochastic Environments,"Yixuan Wang, Simon Zhan, Ruochen Jiao, Zhilu Wang, Wanxin Jin, Zhuoran Yang, Zhaoran Wang, Chao Huang, Qi Zhu",,,,,,,,, Causal Structure Learning for Latent Intervened Non-stationary Data,"Chenxi Liu, Kun Kuang",,,,,,,,, Constrained Decision Transformer for Offline Safe Reinforcement Learning,"Zuxin Liu, Zijian Guo, Yihang Yao, Zhepeng Cen, Wenhao Yu, Tingnan Zhang, Ding Zhao",http://arxiv.org/abs/2302.07351,,https://huggingface.co/papers/2302.07351,,,,2302.07351,7,1 Context-Aware Bayesian Network Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning,"Dingyang Chen, Qi Zhang",http://arxiv.org/abs/2306.01920,,https://huggingface.co/papers/2306.01920,,,,2306.01920,2,0 Graph Mixup with Soft Alignments,"Hongyi Ling, Zhimeng Jiang, Meng Liu, Shuiwang Ji, Na Zou",http://arxiv.org/abs/2306.06788,,https://huggingface.co/papers/2306.06788,,,,2306.06788,5,0 Regularizing Towards Soft Equivariance Under Mixed Symmetries,"Hyunsu Kim, Hyungi Lee, Hongseok Yang, Juho Lee",http://arxiv.org/abs/2306.00356,,https://huggingface.co/papers/2306.00356,,,,2306.00356,4,0 Featured Graph Coarsening with Similarity Guarantees,"MANOJ KUMAR, Anurag Sharma, Shashwat Saxena, Sandeep Kumar",,,,,,,,, Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability,"Jianing Zhu, Hengzhuang Li, Jiangchao Yao, Tongliang Liu, Jianliang Xu, Bo Han",http://arxiv.org/abs/2306.03715,https://github.com/tmlr-group/Unleashing-Mask,https://huggingface.co/papers/2306.03715,,,,2306.03715,6,0 Conditional Graph Information Bottleneck for Molecular Relational Learning,"Namkyeong Lee, Dongmin Hyun, Gyoung S. Na, Sungwon Kim, Junseok Lee, Chanyoung Park",http://arxiv.org/abs/2305.01520,https://github.com/Namkyeong/CGIB,https://huggingface.co/papers/2305.01520,,,,2305.01520,6,0 Reconstructive Neuron Pruning for Backdoor Defense,"Yige Li, XIXIANG LYU, Xingjun Ma, Nodens Koren, Lingjuan Lyu, Bo Li, Yu-Gang Jiang",http://arxiv.org/abs/2305.14876,https://github.com/bboylyg/RNP,https://huggingface.co/papers/2305.14876,,,,2305.14876,7,0 Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization,"Stone Tao, Xiaochen Li, Tongzhou Mu, Zhiao Huang, Yuzhe Qin, Hao Su",http://arxiv.org/abs/2210.07658,,https://huggingface.co/papers/2210.07658,,,,2210.07658,6,1 Multi-View Masked World Models for Visual Robotic Manipulation,"Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel",http://arxiv.org/abs/2302.02408,,https://huggingface.co/papers/2302.02408,,,,2302.02408,6,0 CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets,"Zachary Novack, Julian McAuley, Zachary Lipton, Saurabh Garg",http://arxiv.org/abs/2302.02551,https://github.com/acmi-lab/CHILS,https://huggingface.co/papers/2302.02551,,,,2302.02551,4,1 Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization,"Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang",http://arxiv.org/abs/2305.11965,,https://huggingface.co/papers/2305.11965,,,,2305.11965,6,1 Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL,"Taku Yamagata, Ahmed Khalil, Raul Santos-Rodriguez",,,,,,,,, A Statistical Perspective on Retrieval-Based Models,"Soumya Basu, Ankit Singh Rawat, Manzil Zaheer",,,,,,,,, PFNs4BO: Meta-Learning the surrogate model for Bayesian optimization from scratch using Transformers,"Samuel Gabriel Müller, Matthias Feurer, Noah Hollmann, Frank Hutter",,,,,,,,, "Synthetic Data, Real Errors: How (Not) to Publish and Use Synthetic Data","Boris van Breugel, Zhaozhi Qian, Mihaela van der Schaar",http://arxiv.org/abs/2305.09235,,https://huggingface.co/papers/2305.09235,,,,2305.09235,3,1 Controlled Differential Equations on Long Sequences via Non-standard Wavelets,"Sourav Pal, Zhanpeng Zeng, Sathya Ravi, Vikas Singh",,,,,,,,, Towards a better understanding of representation dynamics under TD-learning,"Yunhao Tang, Remi Munos",http://arxiv.org/abs/2305.18491,,https://huggingface.co/papers/2305.18491,,,,2305.18491,2,0 Reinforcement Learning with History Dependent Dynamic Contexts,"Guy Tennenholtz, Nadav Merlis, Lior Shani, Martin Mladenov, Craig Boutilier",,,,,,,,, Leveraging Demonstrations to Improve Online Learning: Quality Matters,"Botao Hao, Rahul Jain, Tor Lattimore, Benjamin Van Roy, Zheng Wen",http://arxiv.org/abs/2302.03319,,https://huggingface.co/papers/2302.03319,,,,2302.03319,5,0 A theory of representation learning gives a deep generalisation of kernel methods,"Adam Yang, Maxime Robeyns, Edward Milsom, Ben Anson, Nandi Schoots, Laurence Aitchison",http://arxiv.org/abs/2108.13097,,https://huggingface.co/papers/2108.13097,,,,2108.13097,6,0 A Kernelized Stein Discrepancy for Biological Sequences,"Alan Amin, Eli Weinstein, Debora Marks",,,,,,,,, Learning to Maximize Mutual Information for Dynamic Feature Selection,"Ian Covert, Wei Qiu, MingYu Lu, Na Yoon Kim, Nathan White, Su-In Lee",http://arxiv.org/abs/2301.00557,,https://huggingface.co/papers/2301.00557,,,,2301.00557,6,1 Fast Combinatorial Algorithms for Min Max Correlation Clustering,"Sami Davies, Benjamin Moseley, Heather Newman",http://arxiv.org/abs/2301.13079,,https://huggingface.co/papers/2301.13079,,,,2301.13079,3,0 Polynomial Time and Private Learning of Unbounded Gaussian Mixture Models,"Jamil Arbas, Hassan Ashtiani, Christopher Liaw",http://arxiv.org/abs/2303.04288,,https://huggingface.co/papers/2303.04288,,,,2303.04288,3,0 The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond,"Jiin Woo, Gauri Joshi, Yuejie Chi",,,,,,,,, Robust Non-Linear Feedback Coding via Power-Constrained Deep Learning,"Junghoon Kim, Taejoon Kim, David Love, Christopher G. Brinton",http://arxiv.org/abs/2304.13178,,https://huggingface.co/papers/2304.13178,,,,2304.13178,4,0 Feature Directions Matter: Long-Tailed Learning via Rotated Balanced Representation,"Peifeng Gao, Qianqian Xu, Peisong Wen, Zhiyong Yang, Huiyang Shao, Qingming Huang",,,,,,,,, Learning to Boost Training by Periodic Nowcasting Near Future Weights,"Jinhyeok Jang, Woo-han Yun, Won Hwa Kim, Youngwoo Yoon, Jaehong Kim, Jaeyeon Lee, ByungOk Han",,,,,,,,, Surrogate Module Learning: Reduce the Gradient Error Accumulation in Training Spiking Neural Networks,"Shikuang Deng, Hao Lin, Yuhang Li, Shi Gu",,,,,,,,, "mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video","Haiyang Xu, Qinghao Ye, Ming Yan, Yaya Shi, Jiabo Ye, yuanhong xu, Chenliang Li, Bin Bi, Qi Qian, Wei Wang, Guohai Xu, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou",,,,,,,,, Which Invariance Should We Transfer? A Causal Minimax Learning Approach,"Mingzhou Liu, Xiangyu Zheng, Xinwei Sun, Fang Fang, Yizhou Wang",http://arxiv.org/abs/2107.01876,,https://huggingface.co/papers/2107.01876,,,,2107.01876,5,0 Which is Better for Learning with Noisy Labels: The Semi-supervised Method or Modeling Label Noise?,"Yu Yao, Mingming Gong, Yuxuan Du, Jun Yu, Bo Han, Kun Zhang, Tongliang Liu",,,,,,,,, dugMatting: Decomposed-Uncertainty-Guided Matting,"Jiawei Wu, Changqing Zhang, Zuoyong Li, Huazhu Fu, Xi Peng, Joey Tianyi Zhou",http://arxiv.org/abs/2306.01452,,https://huggingface.co/papers/2306.01452,,,,2306.01452,6,2 Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum,"Jigang Kim, Daesol Cho, H. Kim",http://arxiv.org/abs/2305.09943,,https://huggingface.co/papers/2305.09943,,,,2305.09943,3,0 Prompting Large Language Model for Machine Translation: A Case Study,"Biao Zhang, Barry Haddow, Alexandra Birch",http://arxiv.org/abs/2301.07069,,https://huggingface.co/papers/2301.07069,,,,2301.07069,3,0 Expertise Trees Resolve Knowledge Limitations in Collective Decision-Making,"Axel Abels, Tom Lenaerts, Vito Trianni, Ann Nowe",http://arxiv.org/abs/2305.01063,,https://huggingface.co/papers/2305.01063,,,,2305.01063,4,1 Superhuman Fairness,"Omid Memarrast, Linh Vu, Brian Ziebart",http://arxiv.org/abs/2301.13420,,https://huggingface.co/papers/2301.13420,,,,2301.13420,3,1 PWSHAP: A Path-Wise Explanation Model for Targeted Variables,"Lucile Ter-Minassian, Oscar Clivio, Karla DiazOrdaz, Robin Evans, Christopher Holmes",,,,,,,,, "DP-Fast MH: Private, Fast, and Accurate Metropolis-Hastings for Large-Scale Bayesian Inference","Wanrong Zhang, Ruqi Zhang",http://arxiv.org/abs/2303.06171,,https://huggingface.co/papers/2303.06171,,,,2303.06171,2,0 Optimality of Thompson Sampling with Noninformative Priors for Pareto Bandits,"Jongyeong Lee, Junya Honda, Chao-Kai Chiang, Masashi Sugiyama",http://arxiv.org/abs/2302.01544,,https://huggingface.co/papers/2302.01544,,,,2302.01544,4,0 Multiply Robust Off-policy Evaluation and Learning under Truncation by Death,"Jianing Chu, Shu Yang, Wenbin Lu",,,,,,,,, Run-off Election: Improved Provable Defense against Data Poisoning Attacks,"Keivan Rezaei, Kiarash Banihashem, Atoosa Chegini, Soheil Feizi",http://arxiv.org/abs/2302.02300,,https://huggingface.co/papers/2302.02300,,,,2302.02300,4,2 The Ideal Continual Learner: An Agent That Never Forgets,"Liangzu Peng, Paris Giampouras, Rene Vidal",http://arxiv.org/abs/2305.00316,,https://huggingface.co/papers/2305.00316,,,,2305.00316,3,0 Consistency of Multiple Kernel Clustering,"Weixuan Liang, Xinwang Liu, Yong Liu, Chuan Ma, Yunping Zhao, Zhe Liu, En Zhu",,,,,,,,, Oscillation-free Quantization for Low-bit Vision Transformers,"Shih-Yang liu, Zechun Liu, Kwang-Ting Cheng",http://arxiv.org/abs/2302.02210,https://github.com/nbasyl/OFQ,https://huggingface.co/papers/2302.02210,,,,2302.02210,3,1 On the Connection Between MPNN and Graph Transformer,"Chen Cai, Truong Son Hy, Rose Yu, Yusu Wang",http://arxiv.org/abs/2301.11956,,https://huggingface.co/papers/2301.11956,,,,2301.11956,4,0 Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR,"Kaiwen Wang, Nathan Kallus, Wen Sun",http://arxiv.org/abs/2302.03201,,https://huggingface.co/papers/2302.03201,,,,2302.03201,3,0 Blockwise Stochastic Variance-Reduced Methods with Parallel Speedup for Multi-Block Bilevel Optimization,"Quanqi Hu, Zi-Hao Qiu, Zhishuai Guo, Lijun Zhang, Tianbao Yang",http://arxiv.org/abs/2305.18730,,https://huggingface.co/papers/2305.18730,,,,2305.18730,5,0 Policy Mirror Ascent for Efficient and Independent Learning in Mean Field Games,"Batuhan Yardim, Semih Cayci, Matthieu Geist, Niao He",http://arxiv.org/abs/2212.14449,,https://huggingface.co/papers/2212.14449,,,,2212.14449,4,0 Delayed Bandits: When Do Intermediate Observations Help?,"Emmanuel Esposito, Saeed Masoudian, Hao Qiu, Dirk van der Hoeven, Nicolò Cesa-Bianchi, Yevgeny Seldin",http://arxiv.org/abs/2305.19036,,https://huggingface.co/papers/2305.19036,,,,2305.19036,6,0 Omnipredictors for Constrained Optimization,"Lunjia Hu, Inbal Livni Navon, Omer Reingold, Chutong Yang",http://arxiv.org/abs/2209.07463,,https://huggingface.co/papers/2209.07463,,,,2209.07463,4,0 Bandit Online Linear Optimization with Hints and Queries,"Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar, Manish Purohit",,,,,,,,, Neural Network Approximations of PDEs Beyond Linearity: A Representational Perspective,"Tanya Marwah, Zachary Lipton, Jianfeng Lu, Andrej Risteski",http://arxiv.org/abs/2210.12101,,https://huggingface.co/papers/2210.12101,,,,2210.12101,4,0 Attribute-Efficient PAC Learning of Low-Degree Polynomial Threshold Functions with Nasty Noise,"Shiwei Zeng, Jie Shen",http://arxiv.org/abs/2306.00673,,https://huggingface.co/papers/2306.00673,,,,2306.00673,2,0 Sample Complexity Bounds for Learning High-dimensional Simplices in Noisy Regimes,"seyed amir saberi, Amir Najafi, Abolfazl Motahari, Babak Khalaj",http://arxiv.org/abs/2209.05953,,https://huggingface.co/papers/2209.05953,,,,2209.05953,4,1 "Monge, Bregman and Occam: Interpretable Optimal Transport in High-Dimensions with Feature-Sparse Maps","Marco Cuturi, Michal Klein, Pierre Ablin",http://arxiv.org/abs/2302.04065,,https://huggingface.co/papers/2302.04065,,,,2302.04065,3,0 Sketching Meets Differential Privacy: Fast Algorithm for Dynamic Kronecker Projection Maintenance,"Zhao Song, Xin Yang, Yuanyuan Yang, Lichen Zhang",http://arxiv.org/abs/2210.11542,,https://huggingface.co/papers/2210.11542,,,,2210.11542,4,0 Combinatorial Neural Bandits,"Taehyun Hwang, Kyuwook Chai, Min-hwan Oh",http://arxiv.org/abs/2306.00242,,https://huggingface.co/papers/2306.00242,,,,2306.00242,3,0 Reward-Mixing MDPs with Few Contexts are Learnable,"Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor",,,,,,,,, Quantum Speedups for Zero-Sum Games via Improved Dynamic Gibbs Sampling,"Adam Bouland, Yosheb Getachew, Yujia Jin, Aaron Sidford, Kevin Tian",http://arxiv.org/abs/2301.03763,,https://huggingface.co/papers/2301.03763,,,,2301.03763,5,1 Tight Regret Bounds for Single-pass Streaming Multi-armed Bandits,Chen Wang,http://arxiv.org/abs/2306.02208,,https://huggingface.co/papers/2306.02208,,,,2306.02208,1,0 Minimum Width of Leaky-ReLU Neural Networks for Uniform Universal Approximation,"Li'ang Li, Yifei duan, Guanghua Ji, Yongqiang Cai",http://arxiv.org/abs/2305.18460,,https://huggingface.co/papers/2305.18460,,,,2305.18460,4,0 Dynamical Linear Bandits,"Marco Mussi, Alberto Maria Metelli, Marcello Restelli",http://arxiv.org/abs/2211.08997,,https://huggingface.co/papers/2211.08997,,,,2211.08997,3,0 Stochastic Gradient Succeeds for Bandits,"Jincheng Mei, Zixin Zhong, Bo Dai, Alekh Agarwal, Csaba Szepesvari, Dale Schuurmans",,,,,,,,, Minimax estimation of discontinuous optimal transport maps: The semi-discrete case,"Aram-Alexandre Pooladian, Vincent Divol, Jonathan Niles-Weed",http://arxiv.org/abs/2301.11302,,https://huggingface.co/papers/2301.11302,,,,2301.11302,3,0 A unified optimization framework of ANN-SNN Conversion: towards optimal mapping from activation values to firing rates ,"Haiyan Jiang, srinivas anumasa, Giulia De Masi, Huan Xiong, Bin Gu",,,,,,,,, Lowering the Pre-training Tax for Gradient-based Subset Training: A Lightweight Distributed Pre-Training Toolkit,"Yeonju Ro, Zhangyang “Atlas” Wang, Vijay Chidambaram, Aditya Akella",,,,,,,,, Exploring Chemical Space with Score-based Out-of-distribution Generation,"Seul Lee, Jaehyeong Jo, Sung Ju Hwang",http://arxiv.org/abs/2206.07632,https://github.com/SeulLee05/MOOD,https://huggingface.co/papers/2206.07632,,,,2206.07632,3,0 Adaptive Computation with Elastic Input Sequence,"Fuzhao Xue, Fuzhao Xue, Valerii Likhosherstov, Anurag Arnab, Neil Houlsby, Mostafa Dehghani, Yang You",http://arxiv.org/abs/2301.13195,https://github.com/google-research/scenic,https://huggingface.co/papers/2301.13195,,,,2301.13195,6,0 Hierarchical Imitation Learning with Vector Quantized Models,"Kalle Kujanpää, Joni Pajarinen, Alexander Ilin",http://arxiv.org/abs/2301.12962,,https://huggingface.co/papers/2301.12962,,,,2301.12962,3,1 Thompson Sampling with Diffusion Generative Prior,"Yu-Guan Hsieh, Shiva Kasiviswanathan, Branislav Kveton, Patrick Bloebaum",http://arxiv.org/abs/2301.05182,,https://huggingface.co/papers/2301.05182,,,,2301.05182,4,1 SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation,"Shikun Sun, Longhui Wei, Junliang Xing, Jia Jia, Qi Tian",,,,,,,,, Regularization-free Diffeomorphic Temporal Alignment Nets,"Ron Shapira Weber, Oren Freifeld",,,,,,,,, Topologically Faithful Image Segmentation via Induced Matching of Persistence Barcodes,"Nico Stucki, Johannes C. Paetzold, Suprosanna Shit, bjoern menze, Ulrich Bauer",http://arxiv.org/abs/2211.15272,https://github.com/nstucki/Betti-matching,https://huggingface.co/papers/2211.15272,,,,2211.15272,5,0 FedDisco: Federated Learning with Discrepancy-Aware Collaboration,"Rui Ye, Mingkai Xu, Jianyu Wang, Chenxin Xu, Siheng Chen, Yan-Feng Wang",http://arxiv.org/abs/2305.19229,https://github.com/MediaBrain-SJTU/FedDisco,https://huggingface.co/papers/2305.19229,,,,2305.19229,6,0 Personalized Federated Learning with Inferred Collaboration Graphs,"Rui Ye, Zhenyang Ni, Fangzhao Wu, Siheng Chen, Yan-Feng Wang",,,,,,,,, ModelDiff: A Framework for Comparing Learning Algorithms,"Harshay Shah, Sung Min (Sam) Park, Andrew Ilyas, Aleksander Madry",http://arxiv.org/abs/2211.12491,https://github.com/MadryLab/modeldiff,https://huggingface.co/papers/2211.12491,,,,2211.12491,4,1 Half-Hop: A graph upsampling approach for slowing down message passing,"Mehdi Azabou, Venkataramana Ganesh, Shantanu Thakoor, Chi-Heng Lin, Lakshmi Sathidevi, Ran Liu, Michal Valko, Petar Veličković, Eva Dyer",,,,,,,,, Structural Re-weighting Improves Graph Domain Adaptation,"Shikun Liu, Tianchun Li, Yongbin Feng, Nhan Tran, Han Zhao, Qiang Qiu, Pan Li, Pan Li",http://arxiv.org/abs/2306.03221,,https://huggingface.co/papers/2306.03221,,,,2306.03221,7,0 InfoOT: Information Maximizing Optimal Transport,"Ching-Yao Chuang, Stefanie Jegelka, David Alvarez-Melis",http://arxiv.org/abs/2210.03164,,https://huggingface.co/papers/2210.03164,,,,2210.03164,3,0 Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks,"Minyoung Huh, Brian Cheung, Pulkit Agrawal, Phillip Isola",http://arxiv.org/abs/2305.08842,,https://huggingface.co/papers/2305.08842,,,,2305.08842,4,0 Stabilizing Transformer Training by Preventing Attention Entropy Collapse,"Shuangfei Zhai, Tatiana Likhomanenko, Etai Littwin, Dan Busbridge, Jason Ramapuram, Yizhe Zhang, Jiatao Gu, Joshua M Susskind",http://arxiv.org/abs/2303.06296,,https://huggingface.co/papers/2303.06296,,,,2303.06296,8,0 Dataset Distillation with Convexified Implicit Gradients,"Noel Loo, Ramin Hasani, Mathias Lechner, Daniela Rus",http://arxiv.org/abs/2302.06755,,https://huggingface.co/papers/2302.06755,,,,2302.06755,4,0 Regression with Label Permutation in Generalized Linear Model,"Guanhua Fang, Ping Li",http://arxiv.org/abs/2206.11775,,https://huggingface.co/papers/2206.11775,,,,2206.11775,2,0 Learning Unnormalized Statistical Models via Compositional Optimization,"Wei Jiang, Jiayu Qin, Lingyu Wu, Changyou Chen, Tianbao Yang, Lijun Zhang",http://arxiv.org/abs/2306.07485,,https://huggingface.co/papers/2306.07485,,,,2306.07485,6,1 Self-Attention Amortized Distributional Projection Optimization for Sliced Wasserstein Point-Cloud Reconstruction,"Khai Nguyen, Dang Nguyen, Nhat Ho",http://arxiv.org/abs/2301.04791,,https://huggingface.co/papers/2301.04791,,,,2301.04791,3,1 Decentralized SGD and Average-direction SAM are Asymptotically Equivalent,"Tongtian Zhu, Fengxiang He, Kaixuan Chen, Mingli Song, Dacheng Tao",http://arxiv.org/abs/2306.02913,,https://huggingface.co/papers/2306.02913,,,,2306.02913,5,0 Complementary Attention for Multi-Agent Reinforcement Learning,"Jianzhun Shao, Hongchang Zhang, Yun Qu, Chang Liu, Shuncheng He, Yuhang Jiang, Xiangyang Ji",,,,,,,,, MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL,"Fei Ni, Jianye Hao, Yao Mu, Yifu Yuan, Yan Zheng, Bin Wang, Zhixuan Liang",http://arxiv.org/abs/2305.19923,,https://huggingface.co/papers/2305.19923,,,,2305.19923,7,0 Boosting Offline Reinforcement Learning with Action Preference Query,"Qisen Yang, Shenzhi Wang, Matthieu Lin, Shiji Song, Gao Huang",http://arxiv.org/abs/2306.03362,,https://huggingface.co/papers/2306.03362,,,,2306.03362,5,1 Open-VCLIP: Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization,"Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang",,,,,,,,, Change is Hard: A Closer Look at Subpopulation Shift,"Yuzhe Yang, Haoran Zhang, Dina Katabi, Marzyeh Ghassemi",http://arxiv.org/abs/2302.12254,https://github.com/YyzHarry/SubpopBench,https://huggingface.co/papers/2302.12254,,,,2302.12254,4,0 X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion,"Hanqing Zhao, Dianmo Sheng, Jianmin Bao, Dongdong Chen, Dong Chen, Fang Wen, Lu Yuan, Ce Liu, Wenbo Zhou, Qi Chu, Weiming Zhang, Nenghai Yu",,,,,,,,, Never mind the metrics---what about the uncertainty? Visualising confusion matrix metric distributions,"David Lovell, Dimity Miller, Jaiden Capra, Andrew Bradley",,,,,,,,, Uncertainty Estimation for Molecules: Desiderata and Methods,"Tom Wollschläger, Nicholas Gao, Bertrand Charpentier, Mohamed Amine Ketata, Stephan Günnemann",,,,,,,,, Reliable Measures of Spread in High Dimensional Latent Spaces,"Anna Marbut, Travis Wheeler, Katy McKinney-Bock",http://arxiv.org/abs/2212.08172,,https://huggingface.co/papers/2212.08172,,,,2212.08172,3,2 Attributing Image Generative Models using Latent Fingerprints,"Guangyu Nie, Changhoon Kim, 'YZ' Yezhou Yang, Yi Ren",http://arxiv.org/abs/2304.09752,,https://huggingface.co/papers/2304.09752,,,,2304.09752,4,1 Towards Explaining Distribution Shifts,"Sean Kulinski, David I. Inouye",http://arxiv.org/abs/2210.10275,,https://huggingface.co/papers/2210.10275,,,,2210.10275,2,1 SurProGenes: Survival Risk-Ordered Representation of Cancer Patients and Genes for the Identification of Prognostic Genes,"Junetae Kim, Kyoungsuk Park, Hanseok Jeong, Youngwook Kim, Jeongseon Kim, Sun-Young Kim",,,,,,,,, Topological Point Cloud Clustering,"Vincent Grande, Michael Schaub",http://arxiv.org/abs/2303.16716,,https://huggingface.co/papers/2303.16716,,,,2303.16716,2,0 On the Forward Invariance of Neural ODEs,"Wei Xiao, Tsun-Hsuan Wang, Ramin Hasani, Mathias Lechner, Yutong Ban, Chuang Gan, Daniela Rus",http://arxiv.org/abs/2210.04763,,https://huggingface.co/papers/2210.04763,,,,2210.04763,7,0 Harmonic Neural Networks,"Atiyo Ghosh, Antonio Gentile, Mario Dagrada, Chul Lee, Seong-hyok Sean Kim, Hyukgeun Cha, Yunjun Choi, Dongho Kim, JEONG-IL KYE, JEONG-IL KYE, Vincent E Elfving",,,,,,,,, Graph Reinforcement Learning for Network Control via Bi-Level Optimization,"Daniele Gammelli, James Harrison, Kaidi Yang, Marco Pavone, Filipe Rodrigues, Francisco Pereira",http://arxiv.org/abs/2305.09129,,https://huggingface.co/papers/2305.09129,,,,2305.09129,6,0 Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning,"Boyin Liu, Zhiqiang Pu, Yi Pan, Jianqiang Yi, Yanyan Liang, D. Zhang",,,,,,,,, Image Restoration with Mean-Reverting Stochastic Differential Equations,"Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas Schön",http://arxiv.org/abs/2301.11699,https://github.com/Algolzw/image-restoration-sde,https://huggingface.co/papers/2301.11699,,,,2301.11699,5,1 What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?,"Rui Yang, Yong LIN, Xiaoteng Ma, Hao Hu, Chongjie Zhang, Tong Zhang",http://arxiv.org/abs/2305.18882,,https://huggingface.co/papers/2305.18882,,,,2305.18882,6,0 Variance Control for Distributional Reinforcement Learning,"Qi Kuang, Zhoufan Zhu, Liwen Zhang, Fan Zhou",,,,,,,,, "Label Distributionally Robust Losses for Multi-class Classification: Consistency, Robustness and Adaptivity","Dixian Zhu, Yiming Ying, Tianbao Yang",http://arxiv.org/abs/2112.14869,https://github.com/Optimization-AI/ICML2023_LDR,https://huggingface.co/papers/2112.14869,,,,2112.14869,3,0 OpenFE: Automated Feature Generation with Expert-level Performance,"Tianping Zhang, Zheyu Zhang, Zhiyuan Fan, Haoyan Luo, Fengyuan Liu, Qian Liu, Wei Cao, Li Jian",http://arxiv.org/abs/2211.12507,https://github.com/ZhangTP1996/OpenFE,https://huggingface.co/papers/2211.12507,,,,2211.12507,8,1 Weighted Sampling without Replacement for Deep Top-$k$ Classification,"Dieqiao Feng, Yuanqi Du, Carla Gomes, Bart Selman",,,,,,,,, A Flexible Diffusion Model,"weitao du, He Zhang, Tao Yang, Yuanqi Du",http://arxiv.org/abs/2206.10365,,https://huggingface.co/papers/2206.10365,,,,2206.10365,4,0 Learning Expressive Priors for Generalization and Uncertainty Estimation in Neural Networks,"Dominik Schnaus, Jongseok Lee, Daniel Cremers, Rudolph Triebel",,,,,,,,, Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs,"Michael Kirchhof, Enkelejda Kasneci, Seong Joon Oh",http://arxiv.org/abs/2302.02865,https://github.com/mkirchhof/Probabilistic_Contrastive_Learning,https://huggingface.co/papers/2302.02865,,,,2302.02865,3,1 Linear Time GPs for Inferring Latent Trajectories from Neural Spike Trains,"Matthew Dowling, Yuan Zhao, Memming Park",http://arxiv.org/abs/2306.01802,,https://huggingface.co/papers/2306.01802,,,,2306.01802,3,0 Learning Control by Iterative Inversion,"Gal Leibovich, Guy Jacob, Or Avner, Gal Novik, Aviv Tamar",http://arxiv.org/abs/2211.01724,,https://huggingface.co/papers/2211.01724,,,,2211.01724,5,2 Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning,"Tongzhou Wang, Antonio Torralba, Phillip Isola, Amy Zhang",http://arxiv.org/abs/2304.01203,,https://huggingface.co/papers/2304.01203,,,,2304.01203,4,1 DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm,"Yunhao Tang, Tadashi Kozuno, Mark Rowland, Anna Harutyunyan, Remi Munos, Bernardo Avila Pires, Michal Valko",,,,,,,,, On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures,"Xian Yu, Lei Ying",http://arxiv.org/abs/2301.10932,,https://huggingface.co/papers/2301.10932,,,,2301.10932,2,0 Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling,"Yuta Saito, Qingyang Ren, Thorsten Joachims",http://arxiv.org/abs/2305.08062,,https://huggingface.co/papers/2305.08062,,,,2305.08062,3,0 Statistical Inference on Multi-armed Bandits with Delayed Feedback,"Lei Shi, Jingshen Wang, Tianhao Wu",,,,,,,,, Multi-User Reinforcement Learning with Low Rank Rewards,"Naman Agarwal, Prateek Jain, Dheeraj Nagaraj, Suhas Kowshik, Praneeth Netrapalli",http://arxiv.org/abs/2210.05355,,https://huggingface.co/papers/2210.05355,,,,2210.05355,5,1 Additive Causal Bandits with Unknown Graph,"Alan Malek, Virginia Aglietti, Silvia Chiappa",http://arxiv.org/abs/2306.07858,,https://huggingface.co/papers/2306.07858,,,,2306.07858,3,1 Minimizing Trajectory Curvature of ODE-based Generative Models,"Sangyun Lee, Beomsu Kim, Jongchul Ye",http://arxiv.org/abs/2301.12003,https://github.com/sangyun884/fast-ode,https://huggingface.co/papers/2301.12003,,,,2301.12003,3,0 All in a Row: Compressed Convolution Networks for Graphs,"Junshu Sun, Shuhui Wang, XINZHE HAN, Zhe Xue, Qingming Huang",,,,,,,,, Fast Sampling of Diffusion Models via Operator Learning,"Hongkai Zheng, Weili Nie, Arash Vahdat, Kamyar Azizzadenesheli, Anima Anandkumar",http://arxiv.org/abs/2211.13449,,https://huggingface.co/papers/2211.13449,,,,2211.13449,5,0 Model-agnostic Measure of Generalization Difficulty,"Akhilan Boopathy, Kevin Liu, Jaedong Hwang, Shu Ge, Asaad Mohammedsaleh, Ila R. Fiete",http://arxiv.org/abs/2305.01034,,https://huggingface.co/papers/2305.01034,,,,2305.01034,6,1 Quantifying Human Priors over Social and Navigation Networks,Gecia Bravo-Hermsdorff,,,,,,,,, Solving High-Dimensional PDEs with Latent Spectral Models,"Haixu Wu, Tengge Hu, huakun luo, Jianmin Wang, Mingsheng Long",http://arxiv.org/abs/2301.12664,https://github.com/thuml/Latent-Spectral-Models,https://huggingface.co/papers/2301.12664,,,,2301.12664,5,0 Analyzing Diffusion as Serial Reproduction,"Raja Marjieh, Ilia Sucholutsky, Thomas Langlois, Nori Jacoby, Thomas Griffiths",http://arxiv.org/abs/2209.14821,,https://huggingface.co/papers/2209.14821,,,,2209.14821,5,1 PAC Prediction Sets for Large Language Models of Code,"Adam Khakhar, Stephen Mell, Osbert Bastani",http://arxiv.org/abs/2302.08703,,https://huggingface.co/papers/2302.08703,,,,2302.08703,3,0 AutoCoreset: An Automatic Practical Coreset Construction Framework,"Alaa Maalouf, Morad Tukan, Vladimir Braverman, Daniela Rus",http://arxiv.org/abs/2305.11980,https://github.com/alaamaalouf/AutoCoreset,https://huggingface.co/papers/2305.11980,,,,2305.11980,4,0 Learning Perturbations to Explain Time Series Predictions,Joseph Enguehard,http://arxiv.org/abs/2305.18840,,https://huggingface.co/papers/2305.18840,,,,2305.18840,1,1 On the Privacy-Robustness-Utility Trilemma in Distributed Learning,"Youssef Allouah, Rachid Guerraoui, Nirupam Gupta, Rafael Pinot, John Stephan",http://arxiv.org/abs/2302.04787,,https://huggingface.co/papers/2302.04787,,,,2302.04787,5,0 Infinite Action Contextual Bandits with Reusable Data Exhaust,"Mark Rucker, Yinglun Zhu, Paul Mineiro",http://arxiv.org/abs/2302.08551,,https://huggingface.co/papers/2302.08551,,,,2302.08551,3,0 Regret Minimization and Convergence to Equilibria in General-sum Markov Games,"Liad Erez, Tal Lancewicki, Uri Sherman, Tomer Koren, Yishay Mansour",http://arxiv.org/abs/2207.14211,,https://huggingface.co/papers/2207.14211,,,,2207.14211,5,0 Optimal Rates and Efficient Algorithms for Online Bayesian Persuasion,"Martino Bernasconi, Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Francesco Trovò, Nicola Gatti",http://arxiv.org/abs/2303.01296,,https://huggingface.co/papers/2303.01296,,,,2303.01296,6,0 Distributed Linear Bandits under Communication Constraints,"Sudeep Salgia, Qing Zhao",http://arxiv.org/abs/2211.02212,,https://huggingface.co/papers/2211.02212,,,,2211.02212,2,0 Online Mechanism Design for Information Acquisition,"Federico Cacciamani, Matteo Castiglioni, Nicola Gatti",http://arxiv.org/abs/2302.02873,,https://huggingface.co/papers/2302.02873,,,,2302.02873,3,1 Federated Online and Bandit Convex Optimization,"Kumar Kshitij Patel, Lingxiao Wang, Aadirupa Saha, Nati Srebro",,,,,,,,, Statistical Foundations of Prior-Data Fitted Networks,Thomas Nagler,http://arxiv.org/abs/2305.11097,,https://huggingface.co/papers/2305.11097,,,,2305.11097,1,0 Who Needs to Know? Minimal Knowledge for Optimal Coordination,"Niklas Lauffer, Ameesh Shah, Micah Carroll, Michael Dennis, Stuart Russell",http://arxiv.org/abs/2306.09309,,https://huggingface.co/papers/2306.09309,,,,2306.09309,5,1 Neural networks trained with SGD learn distributions of increasing complexity,"Maria Refinetti, Alessandro Ingrosso, Sebastian Goldt",http://arxiv.org/abs/2211.11567,,https://huggingface.co/papers/2211.11567,,,,2211.11567,3,0 Scaling Laws for Multilingual Neural Machine Translation,"Patrick Fernandes, Behrooz Ghorbani, Xavier Garcia, Markus Freitag, Orhan Firat",http://arxiv.org/abs/2302.09650,,https://huggingface.co/papers/2302.09650,,,,2302.09650,5,0 Explaining the effects of non-convergent MCMC in the training of Energy-Based Models,"Elisabeth Agoritsas, Giovanni Catania, Aurélien Decelle, Beatriz Seoane",,,,,,,,, A Three-regime Model of Network Pruning,"Yefan Zhou, Yaoqing Yang, Arin Chang, Michael Mahoney",http://arxiv.org/abs/2305.18383,,https://huggingface.co/papers/2305.18383,,,,2305.18383,4,1 Metagenomic Binning using Connectivity-constrained Variational Autoencoders,"Andre Lamurias, Alessandro Tibo, Katja Hose, Mads Albertsen, Thomas D. Nielsen",,,,,,,,, SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning,"Dongseok Shim, Seungjae Lee, H. Kim",http://arxiv.org/abs/2301.11520,,https://huggingface.co/papers/2301.11520,,,,2301.11520,3,0 Spatial-Temporal Graph Learning with Adversarial Contrastive Adaptation,"Qianru Zhang, Chao Huang, Lianghao Xia, Zheng Wang, Siu Ming Yiu, Ruihua Han",,,,,,,,, Context Consistency Regularization for Label Sparsity in Time Series,"Yooju Shin, Susik Yoon, Hwanjun Song, Dongmin Park, Byunghyun Kim, Jae-Gil Lee, Byung Suk Lee",,,,,,,,, Beyond In-Domain Scenarios: Robust Density-Aware Calibration,"Christian Tomani, Futa Waseda, Yuesong Shen, Daniel Cremers",http://arxiv.org/abs/2302.05118,,https://huggingface.co/papers/2302.05118,,,,2302.05118,4,0 Towards Unbiased Training in Federated Open-world Semi-supervised Learning,"Jie ZHANG, Xiaosong Ma, Song Guo, Wenchao Xu",http://arxiv.org/abs/2305.00771,,https://huggingface.co/papers/2305.00771,,,,2305.00771,4,0 Efficient Training of Language Models using Few-Shot Learning,"Sashank Jakkam Reddi, Sobhan Miryoosefi, Stefani Karp, Shankar Krishnan, Satyen Kale, Seungyeon Kim, Sanjiv Kumar",,,,,,,,, Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning,"Jaehyung Kim, Jinwoo Shin, Dongyeop Kang",http://arxiv.org/abs/2306.04925,,https://huggingface.co/papers/2306.04925,,,,2306.04925,3,0 Controlled Text Generation with Natural Language Instructions,"Wangchunshu Zhou, Yuchen Jiang, Ethan Wilcox, Ryan Cotterell, Mrinmaya Sachan",http://arxiv.org/abs/2304.14293,,https://huggingface.co/papers/2304.14293,,,,2304.14293,5,1 MAGANet: Achieving Combinatorial Generalization by Modeling a Group Action,"Geonho Hwang, Jaewoong Choi, Hyunsoo Cho, Myungjoo Kang",,,,,,,,, Identifying Useful Learnwares for Heterogeneous Label Spaces,"Lan-Zhe Guo, Zhi Zhou, Yu-Feng Li, Zhi-Hua Zhou",,,,,,,,, Do Not Train It: A Linear Neural Architecture Search of Graph Neural Networks,"Peng XU, Lin Zhang, Xuanzhou Liu, Jiaqi Sun, Yue Zhao, Haiqin Yang, Bei Yu",http://arxiv.org/abs/2305.14065,,https://huggingface.co/papers/2305.14065,,,,2305.14065,7,0 Efficient Personalized Federated Learning via Sparse Model-Adaptation,"Daoyuan Chen, Liuyi Yao, Dawei Gao, Bolin Ding, Yaliang Li",http://arxiv.org/abs/2305.02776,,https://huggingface.co/papers/2305.02776,,,,2305.02776,5,0 D2Match: Leveraging Deep Learning and Degeneracy for Subgraph Matching,"Xuanzhou Liu, Lin Zhang, Jiaqi Sun, Yujiu Yang, Haiqin Yang",http://arxiv.org/abs/2306.06380,,https://huggingface.co/papers/2306.06380,,,,2306.06380,5,1 What Makes Entities Similar? A Similarity Flooding Perspective for Multi-sourced Knowledge Graph Embeddings,"Zequn Sun, Jiacheng Huang, Xiaozhou Xu, Qijin Chen, Weijun Ren, Wei Hu",http://arxiv.org/abs/2306.02622,,https://huggingface.co/papers/2306.02622,,,,2306.02622,6,0 CodeIPPrompt: Intellectual Property Infringement Assessment of Code Language Models,"Zhiyuan Yu, Yuhao Wu, Ning Zhang, Chenguang Wang, Yevgeniy Vorobeychik, Chaowei Xiao",,,,,,,,, FedHPO-Bench: A Benchmark Suite for Federated Hyperparameter Optimization,"Zhen WANG, Weirui Kuang, Ce Zhang, Bolin Ding, Yaliang Li",,,,,,,,, A theory of continuous generative flow networks,"Salem Lahlou, Tristan Deleu, Pablo Lemos, Dinghuai Zhang, Alexandra Volokhova, Alex Hernandez-Garcia, Lena Nehale Ezzine, Yoshua Bengio, Nikolay Malkin",http://arxiv.org/abs/2301.12594,,https://huggingface.co/papers/2301.12594,,,,2301.12594,9,1 Multi-Layer Neural Networks as Trainable Ladders of Hilbert Spaces,Zhengdao Chen,,,,,,,,, Gradient Descent Finds the Global Optima of Two-Layer Physics-Informed Neural Networks,"Yihang Gao, Yiqi Gu, Michael Ng",,,,,,,,, Sampling-based Nyström Approximation and Kernel Quadrature,"Satoshi Hayakawa, Harald Oberhauser, Terry Lyons",,,,,,,,, Sample Complexity of Probability Divergences under Group Symmetry,"Ziyu Chen, Markos Katsoulakis, Luc Rey-Bellet, Wei Zhu",http://arxiv.org/abs/2302.01915,,https://huggingface.co/papers/2302.01915,,,,2302.01915,4,0 Personalized Federated Learning under Mixture of Distributions,"Yue Wu, Shuaicheng Zhang, Wenchao Yu, Yanchi Liu, Quanquan Gu, Dawei Zhou, Haifeng Chen, Wei Cheng",http://arxiv.org/abs/2305.01068,,https://huggingface.co/papers/2305.01068,,,,2305.01068,8,2 DIVISION: Memory Efficient Training via Dual Activation Precision,"Guanchu Wang, Zirui Liu, Zhimeng Jiang, Ninghao Liu, Na Zou, Xia Hu",http://arxiv.org/abs/2208.04187,,https://huggingface.co/papers/2208.04187,,,,2208.04187,6,0 Fair yet Asymptotically Equal Collaborative Learning,"Xiaoqiang Lin, Xinyi Xu, See-Kiong Ng, Chuan-Sheng Foo, Bryan Kian Hsiang Low",http://arxiv.org/abs/2306.05764,,https://huggingface.co/papers/2306.05764,,,,2306.05764,5,0 FedCR: Personalized Federated Learning Based on Across-Client Common Representation with Conditional Mutual Information Regularization,"Hao Zhang, Chenglin Li, Wenrui Dai, Junni Zou, Hongkai Xiong",,,,,,,,, Retrosynthetic Planning with Dual Value Networks,"Guoqing Liu, Di Xue, Shufang Xie, Yingce Xia, Austin Tripp, Krzysztof Maziarz, Marwin Segler, Tao Qin, Zongzhang Zhang, Tie-Yan Liu",http://arxiv.org/abs/2301.13755,,https://huggingface.co/papers/2301.13755,,,,2301.13755,10,0 Optimizing NOTEARS Objectives via Topological Swaps,"Chang Deng, Kevin Bello, Bryon Aragam, Pradeep Ravikumar",http://arxiv.org/abs/2305.17277,https://github.com/duntrain/topo,https://huggingface.co/papers/2305.17277,,,,2305.17277,4,0 Matrix Estimation for Individual Fairness,"Cindy Zhang, Sarah Cen, Devavrat Shah",http://arxiv.org/abs/2302.02096,,https://huggingface.co/papers/2302.02096,,,,2302.02096,3,1 Understanding the Impact of Adversarial Robustness on Accuracy Disparity,"Yuzheng Hu, Fan Wu, Hongyang Zhang, Han Zhao",http://arxiv.org/abs/2211.15762,https://github.com/Accuracy-Disparity/AT-on-AD,https://huggingface.co/papers/2211.15762,,,,2211.15762,4,1 Identifiability of Label Noise Transition Matrix,"Yang Liu, Hao Cheng, Kun Zhang",http://arxiv.org/abs/2202.02016,,https://huggingface.co/papers/2202.02016,,,,2202.02016,3,0 Confidence and Dispersity Speak: Characterizing Prediction Matrix for Unsupervised Accuracy Estimation,"Weijian Deng, Yumin Suh, Stephen Gould, Liang Zheng",,,,,,,,, Learning Neural Constitutive Laws from Motion Observations for Generalizable PDE Dynamics,"Pingchuan Ma, Peter Yichen Chen, Bolei Deng, Josh Tenenbaum, Tao Du, Chuang Gan, Wojciech Matusik",http://arxiv.org/abs/2304.14369,,https://huggingface.co/papers/2304.14369,,,,2304.14369,7,1 Random Grid Neural Processes for Parametric Partial Differential Equations,"Arnaud Vadeboncoeur, Ieva Kazlauskaite, Yanni Papandreou, Fehmi Cirak, Mark Girolami, Omer Deniz Akyildiz",http://arxiv.org/abs/2301.11040,,https://huggingface.co/papers/2301.11040,,,,2301.11040,6,0 Revisiting Structured Variational Autoencoders,"Yixiu Zhao, Scott Linderman",http://arxiv.org/abs/2305.16543,,https://huggingface.co/papers/2305.16543,,,,2305.16543,2,0 MODeL: Memory Optimizations for Deep Learning,"Benoit Steiner, Mostafa Elhoushi, Jacob Kahn, James Hegarty",,,,,,,,, Bi-directional Masks for Efficient N:M Sparse Training,"Yuxin Zhang, Yiting Luo, Mingbao Lin, Yunshan Zhong, JingJing Xie, Fei Chao, Rongrong Ji",http://arxiv.org/abs/2302.06058,https://github.com/zyxxmu/Bi-Mask,https://huggingface.co/papers/2302.06058,,,,2302.06058,7,0 Differentially Private Optimization on Large Model at Small Cost,"Zhiqi Bu, Yu-Xiang Wang, Sheng Zha, George Karypis",http://arxiv.org/abs/2210.00038,,https://huggingface.co/papers/2210.00038,,,,2210.00038,4,0 KDEformer: Accelerating Transformers via Kernel Density Estimation,"Amir Zandieh, Insu Han, Insu Han, Majid Daliri, Amin Karbasi",http://arxiv.org/abs/2302.02451,,https://huggingface.co/papers/2302.02451,,,,2302.02451,4,1 No One Idles: Efficient Heterogeneous Federated Learning with Parallel Edge and Server Computation,"Feilong Zhang, Xianming Liu, Shiyi Lin, Gang Wu, Xiong Zhou, Junjun Jiang, Xiangyang Ji",,,,,,,,, RSC: Accelerate Graph Neural Networks Training via Randomized Sparse Computations,"Zirui Liu, CHEN SHENGYUAN, Kaixiong Zhou, Daochen Zha, Xiao Huang, Xia Hu",,,,,,,,, GNN&GBDT-Guided Fast Optimizing Framework for Large-scale Integer Programming,"Huigen Ye, Hua Xu, Hongyan Wang, Chengming WANG, Yu Jiang",,,,,,,,, Symmetry-Aware Robot Design with Structured Subgroups,"Heng Dong, Junyu Zhang, Tonghan Wang, Chongjie Zhang",http://arxiv.org/abs/2306.00036,,https://huggingface.co/papers/2306.00036,,,,2306.00036,4,0 Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition,"Dongqi Cai, Yangyuxuan Kang, Anbang Yao, Yurong Chen",,,,,,,,, Online Prototype Alignment for Few-shot Policy Transfer,"Qi Yi, Rui Zhang, Jiaming Guo, Shaohui Peng, Yunkai Gao, Kaizhao Yuan, Ruizhi Chen, Siming Lan, Xing Hu, Zidong Du, Xishan Zhang, Qi Guo, Yunji Chen",http://arxiv.org/abs/2306.07307,,https://huggingface.co/papers/2306.07307,,,,2306.07307,13,0 DDGR: Continual Learning with Deep Diffusion-based Generative Replay,"Rui Gao, Weiwei Liu",,,,,,,,, Less is More: Task-aware Layer-wise Distillation for Language Model Compression,"Chen Liang, Simiao Zuo, Qingru Zhang, Pengcheng He, Weizhu Chen, Tuo Zhao",http://arxiv.org/abs/2210.01351,https://github.com/cliang1453/task-aware-distillation,https://huggingface.co/papers/2210.01351,,,,2210.01351,6,1 Proper Losses for Discrete Generative Models,"Rafael Frongillo, Dhamma Kimpara, Bo Waggoner",http://arxiv.org/abs/2211.03761,,https://huggingface.co/papers/2211.03761,,,,2211.03761,3,0 ClusterFuG: Clustering Fully connected Graphs by Multicut,"Ahmed Abbas, Paul Swoboda",http://arxiv.org/abs/2301.12159,,https://huggingface.co/papers/2301.12159,,,,2301.12159,2,0 Multi-Task Off-Policy Learning from Bandit Feedback,"Joey Hong, Branislav Kveton, Manzil Zaheer, Sumeet Katariya, Mohammad Ghavamzadeh",http://arxiv.org/abs/2212.04720,,https://huggingface.co/papers/2212.04720,,,,2212.04720,5,0 Efficient Graph Field Integrators Meet Point Clouds,"Krzysztof Choromanski, Arijit Sehanobish, Han Lin, YUNFAN ZHAO, Eli Berger, Tetiana Parshakova, Qingkai Pan, David Watkins, Tianyi Zhang, Valerii Likhosherstov, Somnath Basu Roy Chowdhury, Kumar Avinava Dubey, Deepali Jain, Tamas Sarlos, Snigdha Chaturvedi, Adrian Weller",http://arxiv.org/abs/2302.00942,,https://huggingface.co/papers/2302.00942,,,,2302.00942,16,0 Moccasin: Efficient Tensor Rematerialization for Neural Networks,"Burak Bartan, Haoming Li, Harris Teague, Christopher Lott, Bistra Dilkina",http://arxiv.org/abs/2304.14463,,https://huggingface.co/papers/2304.14463,,,,2304.14463,5,0 STEERING : Stein Information Directed Exploration for Model-Based Reinforcement Learning,"Souradip Chakraborty, Amrit Bedi, Alec Koppel, Mengdi Wang, Furong Huang, Dinesh Manocha",,,,,,,,, The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning,"Sarah Rathnam, Sonali Parbhoo, Weiwei Pan, Susan Murphy, Finale Doshi-Velez",,,,,,,,, Towards Understanding and Improving GFlowNet Training,"Max Shen, Emmanuel Bengio, Ehsan Hajiramezanali, Andreas Loukas, Kyunghyun Cho, Tommaso Biancalani",http://arxiv.org/abs/2305.07170,,https://huggingface.co/papers/2305.07170,,,,2305.07170,6,1 Using Perturbation to Improve Goodness-of-Fit Tests based on Kernelized Stein Discrepancy,"Xing Liu, Andrew Duncan, Axel Gandy",http://arxiv.org/abs/2304.14762,,https://huggingface.co/papers/2304.14762,,,,2304.14762,3,0 Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons,"Banghua Zhu, Michael Jordan, Jiantao Jiao",http://arxiv.org/abs/2301.11270,,https://huggingface.co/papers/2301.11270,,,,2301.11270,3,1 Adaptive Barrier Smoothing for First-Order Policy Gradient with Contact Dynamics,"Shenao Zhang, Wanxin Jin, Zhaoran Wang",,,,,,,,, Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic,"Wesley A. Suttle, Amrit Bedi, Bhrij Patel, Brian Sadler, Alec Koppel, Dinesh Manocha",http://arxiv.org/abs/2301.12083,,https://huggingface.co/papers/2301.12083,,,,2301.12083,6,0 Provable Data Subset Selection For Efficient Neural Networks Training,"Morad Tukan, Samson Zhou, Alaa Maalouf, Daniela Rus, Vladimir Braverman, Dan Feldman",,,,,,,,, BNN-DP: Robustness Certification of Bayesian Neural Networks via Dynamic Programming,"Steven Adams, Andrea Patane, Morteza Lahijanian, Luca Laurenti",,,,,,,,, Semi-Dual Unbalanced Quadratic Optimal Transport: fast statistical rates and convergent algorithm.,"Adrien Vacher, François-Xavier Vialard",,,,,,,,, Existence and Estimation of Critical Batch Size for Training Generative Adversarial Networks with Two Time-Scale Update Rule,"Naoki Sato, Hideaki Iiduka",http://arxiv.org/abs/2201.11989,,https://huggingface.co/papers/2201.11989,,,,2201.11989,2,0 Does a Neural Network Really Encode Symbolic Concepts?,"Mingjie Li, Quanshi Zhang",,,,,,,,, Bayesian Neural Networks Avoid Encoding Complex and Perturbation-Sensitive Concepts,"Qihan Ren, Huiqi Deng, Yunuo Chen, Siyu Lou, Quanshi Zhang",,,,,,,,, DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design,"Jiaqi Guan, Xiangxin Zhou, Yuwei Yang, Yu Bao, Jian Peng, Jianzhu Ma, Qiang Liu, Liang Wang, Quanquan Gu",,,,,,,,, FLEX: an Adaptive Exploration Algorithm for Nonlinear Systems,"Matthieu Blanke, Marc Lelarge",http://arxiv.org/abs/2304.13426,,https://huggingface.co/papers/2304.13426,,,,2304.13426,2,0 Efficient and Equivariant Graph Networks for Predicting Quantum Hamiltonian,"Haiyang Yu, Zhao Xu, Xiaofeng Qian, Xiaoning Qian, Shuiwang Ji",http://arxiv.org/abs/2306.04922,https://github.com/divelab/AIRS,https://huggingface.co/papers/2306.04922,,,,2306.04922,5,0 Unifying Molecular and Textual Representations via Multi-task Language Modelling,"Dimitrios Christofidellis, Giorgio Giannone, Jannis Born, Ole Winther, Teodoro Laino, Matteo Manica",http://arxiv.org/abs/2301.12586,,https://huggingface.co/papers/2301.12586,,,,2301.12586,6,1 On the Functional Similarity of Robust and Non-Robust Neural Representations,"András Balogh, Mark Jelasity",,,,,,,,, Probabilistic Concept Bottleneck Models,"Eunji Kim, Dahuin Jung, Sangha Park, Siwon Kim, Sungroh Yoon",http://arxiv.org/abs/2306.01574,https://github.com/ejkim47/prob-cbm,https://huggingface.co/papers/2306.01574,,,,2306.01574,5,1 Domain Adaptation Under Relaxed Label Shift,"Saurabh Garg, Nick Erickson, University of California James Sharpnack, Alex Smola, Sivaraman Balakrishnan, Zachary Lipton",,,,,,,,, Learning Noisy OR Bayesian Networks with Max-Product Belief Propagation,"Antoine Dedieu, Guangyao Zhou, Dileep George, Miguel Lazaro-Gredilla",,,,,,,,, Scaling Laws for Reward Model Overoptimization,"Leo Gao, John Schulman, Jacob Hilton",http://arxiv.org/abs/2210.10760,,https://huggingface.co/papers/2210.10760,,,,2210.10760,3,0 Vertical Federated Graph Neural Network for Recommender System,"Peihua Mai, Yan (James) Pang",http://arxiv.org/abs/2303.05786,,https://huggingface.co/papers/2303.05786,,,,2303.05786,2,0 Parallel Online Clustering of Bandits via Hedonic Game,"Xiaotong Cheng, Cheng Pan, Setareh Maghsudi",,,,,,,,, Generalization Analysis for Contrastive Representation Learning,"Yunwen Lei, Tianbao Yang, Yiming Ying, Ding-Xuan Zhou",http://arxiv.org/abs/2302.12383,,https://huggingface.co/papers/2302.12383,,,,2302.12383,4,0 Benign Overfitting in Two-layer ReLU Convolutional Neural Networks,"Yiwen Kou, Zixiang Chen, Yuanzhou Chen, Quanquan Gu",,,,,,,,, Improving Adversarial Robustness by Putting More Regularizations on Less Robust Samples,"Dongyoon Yang, Insung Kong, Yongdai Kim",http://arxiv.org/abs/2206.03353,,https://huggingface.co/papers/2206.03353,,,,2206.03353,3,0 Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and Implicit Bias,"Ryo Karakida, Tomoumi Takase, Tomohiro Hayase, Kazuki Osawa",http://arxiv.org/abs/2210.02720,,https://huggingface.co/papers/2210.02720,,,,2210.02720,4,0 Gradient Descent Converges Linearly for Logistic Regression on Separable Data,"Kyriakos Axiotis, Maxim Sviridenko",,,,,,,,, Federated Adversarial Learning: A Framework with Convergence Analysis,"Xiaoxiao Li, Zhao Song, Jiaming Yang",http://arxiv.org/abs/2208.03635,,https://huggingface.co/papers/2208.03635,,,,2208.03635,3,0 Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs,"Junkai Zhang, Weitong Zhang, Quanquan Gu",http://arxiv.org/abs/2303.10165,,https://huggingface.co/papers/2303.10165,,,,2303.10165,3,1 Policy Regularization with Dataset Constraint for Offline Reinforcement Learning,"Yuhang Ran, Yi-Chen Li, Fuxiang Zhang, Zongzhang Zhang, Yang Yu",http://arxiv.org/abs/2306.06569,https://github.com/LAMDA-RL/PRDC,https://huggingface.co/papers/2306.06569,,,,2306.06569,5,0 Beyond Reward: Offline Preference-guided Policy Optimization,"Yachen Kang, Diyuan Shi, Jinxin Liu, Li He, Donglin Wang",http://arxiv.org/abs/2305.16217,,https://huggingface.co/papers/2305.16217,,,,2305.16217,5,0 Solving Linear Program with Fast Online Learning Algorithms,"Wenzhi Gao, Dongdong Ge, Chunlin Sun, Yinyu Ye",,,,,,,,, Semi-Parametric Contextual Pricing Algorithm using Cox Proportional Hazards Model,"Young-Geun Choi, Gi-Soo Kim, Choi Yunseo, Wooseong Cho, Myunghee Cho Paik, Min-hwan Oh",,,,,,,,, Poisoning Language Models During Instruction Tuning,"Alexander Wan, Eric Wallace, Sheng Shen, Dan Klein",http://arxiv.org/abs/2305.00944,,https://huggingface.co/papers/2305.00944,,,,2305.00944,4,0 Automatically Auditing Large Language Models via Discrete Optimization,"Erik Jones, Anca Dragan, Aditi Raghunathan, Jacob Steinhardt",http://arxiv.org/abs/2303.04381,,https://huggingface.co/papers/2303.04381,,,,2303.04381,4,1 Data Structures for Density Estimation,"Anders Aamand, Alexandr Andoni, Justin Chen, Piotr Indyk, Shyam Narayanan, Sandeep Silwal",,,,,,,,, Provably Invariant Learning without Domain Information,"Xiaoyu Tan, Yong LIN, Shengyu Zhu, Chao Qu, Xihe Qiu, Xu Yinghui, Peng Cui, Yuan Qi",,,,,,,,, Online Platt Scaling with Calibeating,"Chirag Gupta, Aaditya Ramdas",http://arxiv.org/abs/2305.00070,,https://huggingface.co/papers/2305.00070,,,,2305.00070,2,0 An Effective Meaningful Way to Evaluate Survival Models,"Shi-ang Qi, Neeraj Kumar, Mahtab Farrokh, Weijie Sun, Li-Hao Kuan, Rajesh Ranganath, Ricardo Henao, Russell Greiner",http://arxiv.org/abs/2306.01196,,https://huggingface.co/papers/2306.01196,,,,2306.01196,8,1 Evaluating Unsupervised Denoising Requires Unsupervised Metrics,"Adrià Marcos Morales, Matan Leibovich, Sreyas Mohan, Joshua Vincent, Piyush Haluai, Mai Tan, Peter Crozier, Carlos Fernandez-Granda",http://arxiv.org/abs/2210.05553,,https://huggingface.co/papers/2210.05553,,,,2210.05553,8,0 Hidden symmetries of ReLU networks,"Elisenda Grigsby, Kathryn Lindsey, David Rolnick",http://arxiv.org/abs/2306.06179,,https://huggingface.co/papers/2306.06179,,,,2306.06179,3,0 Modeling Dynamic Environments with Scene Graph Memory,"Andrey Kurenkov, Michael Lingelbach, Tanmay Agarwal, Emily Jin, Chengshu Li, Ruohan Zhang, Li Fei-Fei, Jiajun Wu, Silvio Savarese, Roberto Martín-Martín",http://arxiv.org/abs/2305.17537,,https://huggingface.co/papers/2305.17537,,,,2305.17537,10,2 Motion Question Answering via Modular Motion Programs,"Mark Endo, Joy Hsu, Jiaman Li, Jiajun Wu",http://arxiv.org/abs/2305.08953,,https://huggingface.co/papers/2305.08953,,,,2305.08953,4,2 Homomorphism AutoEncoder --- Learning Group Structured Representations from Observed Transitions,"Hamza Keurti, Hsiao-Ru Pan, Michel Besserve, Benjamin F. Grewe, Bernhard Schölkopf",,,,,,,,, MEWL: Few-shot multimodal word learning with referential uncertainty,"Guangyuan Jiang, Manjie Xu, Shiji Xin, Wei Liang, Yujia Peng, Chi Zhang, Yixin Zhu",http://arxiv.org/abs/2306.00503,,https://huggingface.co/papers/2306.00503,,,,2306.00503,7,1 Neural Stochastic Differential Games for Time-series Analysis,"Sung Woo Park, Byoungwoo Park, Moontae Lee, Changhee Lee",,,,,,,,, Learning Physical Models that Can Respect Conservation Laws,"Derek Hansen, Danielle Robinson, Shima Alizadeh, Gaurav Gupta, Michael Mahoney",http://arxiv.org/abs/2302.11002,,https://huggingface.co/papers/2302.11002,,,,2302.11002,5,0 Implicit Neural Spatial Representations for Time-dependent PDEs,"Honglin Chen, Rundi Wu, Eitan Grinspun, Changxi Zheng, Peter Yichen Chen",http://arxiv.org/abs/2210.00124,,https://huggingface.co/papers/2210.00124,,,,2210.00124,5,1 On the Estimation of Gaussian Mixture Copula Models,Ashu Tewari,,,,,,,,, Patch-level Contrastive Learning via Positional Query for Visual Pre-training,"Shaofeng Zhang, Qiang Zhou, Zhibin Wang, Fan Wang, Junchi Yan",,,,,,,,, Quantifying the Knowledge in GNNs for Reliable Distillation into MLPs,"Lirong Wu, Haitao Lin, Yufei Huang, Stan Z Li",http://arxiv.org/abs/2306.05628,,https://huggingface.co/papers/2306.05628,,,,2306.05628,4,1 Improving Visual Prompt Tuning for Self-supervised Vision Transformers,"Seungryong Yoo, Eunji Kim, Dahuin Jung, JUNGBEOM LEE, Sungroh Yoon",http://arxiv.org/abs/2306.05067,https://github.com/ryongithub/GatedPromptTuning,https://huggingface.co/papers/2306.05067,,,,2306.05067,5,1 Model-Aware Contrastive Learning: Towards Escaping the Dilemmas,"Zizheng Huang, Haoxing Chen, Ziqi Wen, Chao Zhang, Huaxiong Li, Bo Wang, Chunlin Chen",http://arxiv.org/abs/2207.07874,,https://huggingface.co/papers/2207.07874,,,,2207.07874,7,2 A Closer Look at Few-shot Classification Again,"Xu Luo, Hao Wu, Ji Zhang, Lianli Gao, Jing Xu, Jingkuan Song",http://arxiv.org/abs/2301.12246,https://github.com/Frankluox/CloserLookAgainFewShot,https://huggingface.co/papers/2301.12246,,,,2301.12246,6,1 Structured Cooperative Learning with Graphical Model Priors,"Shuangtong Li, Tianyi Zhou, Xinmei Tian, Dacheng Tao",http://arxiv.org/abs/2306.09595,https://github.com/ShuangtongLi/SCooL,https://huggingface.co/papers/2306.09595,,,,2306.09595,4,1 On Penalty-based Bilevel Gradient Descent Method,"Han Shen, Tianyi Chen",http://arxiv.org/abs/2302.05185,,https://huggingface.co/papers/2302.05185,,,,2302.05185,3,0 Beyond Uniform Lipschitz Condition in Differentially Private Optimization,"Rudrajit Das, Satyen Kale, Zheng Xu, Tong Zhang, Sujay Sanghavi",http://arxiv.org/abs/2206.10713,,https://huggingface.co/papers/2206.10713,,,,2206.10713,5,0 Approximation and Estimation Ability of Transformers for Sequence-to-Sequence Functions with Infinite Dimensional Input,"Shokichi Takakura, Taiji Suzuki",http://arxiv.org/abs/2305.18699,,https://huggingface.co/papers/2305.18699,,,,2305.18699,2,0 Masked Bayesian Neural Networks : Theoretical Guarantee and its Posterior Inference,"Insung Kong, Dongyoon Yang, Jongjin Lee, Ilsang Ohn, GYUSEUNG BAEK, Yongdai Kim",http://arxiv.org/abs/2305.14765,,https://huggingface.co/papers/2305.14765,,,,2305.14765,6,0 Finite-Sample Analysis of Learning High-Dimensional Single ReLU Neuron,"Jingfeng Wu, Difan Zou, Zixiang Chen, Vladimir Braverman, Quanquan Gu, Sham Kakade",,,,,,,,, Improved Regret for Efficient Online Reinforcement Learning with Linear Function Approximation,"Uri Sherman, Tomer Koren, Yishay Mansour",http://arxiv.org/abs/2301.13087,,https://huggingface.co/papers/2301.13087,,,,2301.13087,3,0 Online Learning with Feedback Graphs: The True Shape of Regret,"Tomáš Kocák, Alexandra Carpentier",http://arxiv.org/abs/2306.02971,,https://huggingface.co/papers/2306.02971,,,,2306.02971,2,0 A Scalable Frank-Wolfe-Based Algorithm for the Max-Cut SDP,"Chi Bach Pham, Wynita Griggs, James Saunderson",,,,,,,,, Contextual Combinatorial Bandits with Probabilistically Triggered Arms,"Xutong Liu, Jinhang Zuo, Siwei Wang, John C.S. Lui, Mohammad Hajiesmaili, Adam Wierman, Wei Chen",http://arxiv.org/abs/2303.17110,,https://huggingface.co/papers/2303.17110,,,,2303.17110,7,0 CRISP: Curriculum based Sequential neural decoders for Polar code family,"S Ashwin Hebbar, Viraj Nadkarni, Ashok Vardhan Makkuva, Suma Bhat, Sewoong Oh, Pramod Viswanath",http://arxiv.org/abs/2210.00313,,https://huggingface.co/papers/2210.00313,,,,2210.00313,6,1 On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits,"Weitong Zhang, Jiafan He, Jiafan He, Zhiyuan Fan, Quanquan Gu",http://arxiv.org/abs/2303.09390,,https://huggingface.co/papers/2303.09390,,,,2303.09390,4,1 Brainformers: Trading Simplicity for Efficiency,"Yanqi Zhou, Nan Du, Yanping Huang, Daiyi Peng, Chang Lan, Da Huang, Siamak Shakeri, David So, Andrew Dai, Yifeng Lu, Zhifeng Chen, Quoc Le, Claire Cui, James Laudon, Jeff Dean",http://arxiv.org/abs/2306.00008,,https://huggingface.co/papers/2306.00008,,,,2306.00008,15,3 On the Training Instability of Shuffling SGD with Batch Normalization,"David X. Wu, Chulhee Yun, Suvrit Sra",http://arxiv.org/abs/2302.12444,,https://huggingface.co/papers/2302.12444,,,,2302.12444,3,0 Dropout Reduces Underfitting,"Zhuang Liu, Zhiqiu (Oscar) Xu, Joseph Jin, Zhiqiang Shen, Trevor Darrell",http://arxiv.org/abs/2303.01500,https://github.com/facebookresearch/dropout,https://huggingface.co/papers/2303.01500,,,,2303.01500,5,1 A modern look at the relationship between sharpness and generalization,"Maksym Andriushchenko, Francesco Croce, Maximilian Müller, Matthias Hein, Nicolas Flammarion",http://arxiv.org/abs/2302.07011,https://github.com/tml-epfl/sharpness-vs-generalization,https://huggingface.co/papers/2302.07011,,,,2302.07011,5,1 Weak Proxies are Sufficient and Preferable for Fairness with Missing Sensitive Attributes,"Zhaowei Zhu, Yuanshun Yao, Jiankai Sun, Hang Li, Yang Liu",http://arxiv.org/abs/2210.03175,,https://huggingface.co/papers/2210.03175,,,,2210.03175,5,0 Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning Using Independent Component Analysis,"Sanjay Kariyappa, Chuan Guo, Kiwan Maeng, Wenjie Xiong, G. Edward Suh, Moinuddin Qureshi, Hsien-Hsin Sean Lee",http://arxiv.org/abs/2209.05578,,https://huggingface.co/papers/2209.05578,,,,2209.05578,7,1 On the Robustness of Randomized Ensembles to Adversarial Perturbations,"Hassan Dbouk, Naresh Shanbhag",http://arxiv.org/abs/2302.01375,https://github.com/hsndbk4/BARRE,https://huggingface.co/papers/2302.01375,,,,2302.01375,2,0 Graph Contrastive Backdoor Attacks,"Hangfan Zhang, Jinghui Chen, Lu Lin, Jinyuan Jia, Dinghao Wu",,,,,,,,, Exploring Model Dynamics for Accumulative Poisoning Discovery,"Jianing Zhu, Xiawei Guo, Jiangchao Yao, Chao Du, LI He, Shuo Yuan, Tongliang Liu, Liang Wang, Bo Han",http://arxiv.org/abs/2306.03726,https://github.com/tmlr-group/Memorization-Discrepancy,https://huggingface.co/papers/2306.03726,,,,2306.03726,9,0 Detecting Adversarial Data by Probing Multiple Perturbations Using Expected Perturbation Score,"Shuhai Zhang, Feng Liu, Jiahao Yang, 逸凡 杨, Changsheng Li, Bo Han, Mingkui Tan",http://arxiv.org/abs/2305.16035,,https://huggingface.co/papers/2305.16035,,,,2305.16035,7,0 Progressive Purification for Instance-Dependent Partial Label Learning,"Ning Xu, biao liu, JIAQI LYU, Congyu Qiao, Xin Geng",http://arxiv.org/abs/2206.00830,,https://huggingface.co/papers/2206.00830,,,,2206.00830,5,0 Safe Offline Reinforcement Learning with Real-Time Budget Constraints,"qian lin, Tang Bo, Zifan Wu, Chao Yu, Shangqin Mao, Qianlong Xie, Xingxing Wang, Dong Wang",http://arxiv.org/abs/2306.00603,,https://huggingface.co/papers/2306.00603,,,,2306.00603,8,0 Semi-Offline Reinforcement Learning for Optimized Text Generation,"Changyu Chen, Xiting Wang, Yiqiao Jin, Victor Ye Dong, Li Dong, Rui Yan, Jie Cao, Yi Liu",http://arxiv.org/abs/2306.09712,,https://huggingface.co/papers/2306.09712,,,,2306.09712,8,0 LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework,"WOOJUN KIM, Jeonghye Kim, Youngchul Sung",,,,,,,,, Multi-task Representation Learning for Pure Exploration in Linear Bandits,"Yihan Du, Longbo Huang, Wen Sun",http://arxiv.org/abs/2302.04441,,https://huggingface.co/papers/2302.04441,,,,2302.04441,3,0 Multi-Objective GFlowNets,"Moksh Jain, Sharath Chandra Raparthy, Alex Hernandez-Garcia, Jarrid Rector-Brooks, Yoshua Bengio, Santiago Miret, Emmanuel Bengio",http://arxiv.org/abs/2210.12765,,https://huggingface.co/papers/2210.12765,,,,2210.12765,7,2 Long-Term Rhythmic Video Soundtracker,"Jiashuo Yu, Yaohui Wang, Xinyuan Chen, Xiao Sun, Yu Qiao",http://arxiv.org/abs/2305.01319,https://github.com/OpenGVLab/LORIS,https://huggingface.co/papers/2305.01319,,,,2305.01319,5,1 Global Context Vision Transformers,"Ali Hatamizadeh, Hongxu Yin, Greg Heinrich, Jan Kautz, Pavlo Molchanov",http://arxiv.org/abs/2206.09959,,https://huggingface.co/papers/2206.09959,,,,2206.09959,4,1 Modality-Agnostic Variational Compression of Implicit Neural Representations,"Jonathan Richard Schwarz, Jihoon Tack, Yee-Whye Teh, Jaeho Lee, Jinwoo Shin",http://arxiv.org/abs/2301.09479,,https://huggingface.co/papers/2301.09479,,,,2301.09479,5,2 Diffusion Based Representation Learning,"Sarthak Mittal, Korbinian Abstreiter, Stefan Bauer, Bernhard Schölkopf, Arash Mehrjou",,,,,,,,, Adaptively Weighted Data Augmentation Consistency Regularization for Robust Optimization under Concept Shift,"Yijun Dong, Yuege Xie, Rachel Ward",http://arxiv.org/abs/2210.01891,,https://huggingface.co/papers/2210.01891,,,,2210.01891,3,0 Neural Diffusion Processes,"Vincent Dutordoir, Alan Saul, Zoubin Ghahramani, Fergus Simpson",http://arxiv.org/abs/2206.03992,,https://huggingface.co/papers/2206.03992,,,,2206.03992,4,0 Constrained Causal Bayesian Optimization,"Virginia Aglietti, Alan Malek, Ira Ktena, Silvia Chiappa",http://arxiv.org/abs/2305.20011,,https://huggingface.co/papers/2305.20011,,,,2305.20011,4,0 On Data Manifolds Entailed by Structural Causal Models,"Ricardo Dominguez-Olmedo, Amir-Hossein Karimi, Georgios Arvanitidis, Bernhard Schölkopf",,,,,,,,, Comparison of meta-learners for estimating multi-valued treatment heterogeneous effects,"Naoufal Acharki, Ramiro Lugo, Antoine Bertoncello, Josselin Garnier",http://arxiv.org/abs/2205.14714,,https://huggingface.co/papers/2205.14714,,,,2205.14714,4,0 LinSATNet: The Positive Linear Satisfiability Neural Networks,"Runzhong Wang, Yunhao Zhang, Ziao Guo, Tianyi Chen, Xiaokang Yang, Junchi Yan",,,,,,,,, On the Complexity of Bayesian Generalization,"Yu-Zhe Shi, Manjie Xu, John Hopcroft, Kun He, Josh Tenenbaum, Song-Chun Zhu, Ying Nian Wu, Wenjuan Han, Yixin Zhu",http://arxiv.org/abs/2211.11033,,https://huggingface.co/papers/2211.11033,,,,2211.11033,9,0 QAS-Bench: Rethinking Quantum Architecture Search and A Benchmark,"Xudong Lu, Kaisen Pan, Ge Yan, Jiaming Shan, Wenjie Wu, Junchi Yan",,,,,,,,, Not all Strongly Rayleigh Distributions Have Small Probabilistic Generating Circuits,Markus Bläser,,,,,,,,, PAL: Program-aided Language Models,"Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, Graham Neubig",http://arxiv.org/abs/2211.10435,,https://huggingface.co/papers/2211.10435,,,,2211.10435,8,2 Tighter Bounds on the Expressivity of Transformer Encoders,"David Chiang, Peter Cholak, Anand Pillay",http://arxiv.org/abs/2301.10743,,https://huggingface.co/papers/2301.10743,,,,2301.10743,3,0 Efficient Algorithms for Exact Graph Matching on Correlated Stochastic Block Models with Constant Correlation,"Joonhyuk Yang, Shin Dongpil, Hye Won Chung",http://arxiv.org/abs/2305.19666,,https://huggingface.co/papers/2305.19666,,,,2305.19666,3,0 Causal Discovery with Latent Confounders Based on Higher-Order Cumulants,"Ruichu Cai, Zhiyi Huang, Wei Chen, Zhifeng Hao, Kun Zhang",http://arxiv.org/abs/2305.19582,,https://huggingface.co/papers/2305.19582,,,,2305.19582,5,0 The Computational Complexity of Concise Hypersphere Classification,"Eduard Eiben, Robert Ganian, Iyad Kanj, Sebastian Ordyniak, Stefan Szeider",,,,,,,,, Synergies between Disentanglement and Sparsity: Generalization and Identifiability in Multi-Task Learning,"Sébastien Lachapelle, Tristan Deleu, Divyat Mahajan, Ioannis Mitliagkas, Yoshua Bengio, Simon Lacoste-Julien, Quentin Bertrand",http://arxiv.org/abs/2211.14666,,https://huggingface.co/papers/2211.14666,,,,2211.14666,7,1 Accelerated Stochastic Optimization Methods under Quasar-convexity,"Qiang Fu, Dongchu Xu, Ashia Wilson",http://arxiv.org/abs/2305.04736,,https://huggingface.co/papers/2305.04736,,,,2305.04736,3,0 Learning Rate Schedules in the Presence of Distribution Shift,"Matthew Fahrbach, Adel Javanmard, Vahab Mirrokni, Pratik Worah",http://arxiv.org/abs/2303.15634,,https://huggingface.co/papers/2303.15634,,,,2303.15634,4,1 Scalable Safe Policy Improvement via Monte Carlo Tree Search,"Alberto Castellini, Federico Bianchi, Edoardo Zorzi, Thiago Simão, Alessandro Farinelli, Matthijs T. J. Spaan",,,,,,,,, Provable Reset-free Reinforcement Learning by No-Regret Reduction,"Hoai-An Nguyen, Ching-An Cheng",http://arxiv.org/abs/2301.02389,,https://huggingface.co/papers/2301.02389,,,,2301.02389,2,0 Gibbsian Polar Slice Sampling,"Philip Schär, Michael Habeck, Daniel Rudolf",http://arxiv.org/abs/2302.03945,,https://huggingface.co/papers/2302.03945,,,,2302.03945,3,0 Stochastic Gradient Descent under Markov-Chain Sampling Schemes,Mathieu Even,,,,,,,,, Randomized Gaussian Process Upper Confidence Bound with Tighter Bayesian Regret Bounds,"Shion Takeno, Yu Inatsu, Masayuki Karasuyama",http://arxiv.org/abs/2302.01511,,https://huggingface.co/papers/2302.01511,,,,2302.01511,3,0 Enabling First-Order Gradient-Based Learning for Equilibrium Computation in Markets,"Nils Kohring, Fabian R. Pieroth, Martin Bichler",http://arxiv.org/abs/2303.09500,,https://huggingface.co/papers/2303.09500,,,,2303.09500,3,0 Semiparametrically Efficient Off-Policy Evaluation in Linear Markov Decision Processes,"Chuhan Xie, Wenhao Yang, Zhihua Zhang",,,,,,,,, Estimation Beyond Data Reweighting: Kernel Method of Moments,"Heiner Kremer, Yassine Nemmour, Bernhard Schölkopf, Jia-Jie Zhu",http://arxiv.org/abs/2305.10898,,https://huggingface.co/papers/2305.10898,,,,2305.10898,4,0 PCA-based Multi-Task Learning: a Random Matrix Approach,"Malik TIOMOKO, Romain COUILLET, Frederic Pascal",,,,,,,,, Sparsity by Redundancy: Solving $L_1$ with SGD,"Liu Ziyin, Zihao Wang",,,,,,,,, On the convergence of the MLE as an estimator of the learning rate in the Exp3 algorithm,"Julien Aubert, Luc Lehéricy, Patricia Reynaud-Bouret",http://arxiv.org/abs/2305.06660,,https://huggingface.co/papers/2305.06660,,,,2305.06660,3,0 Efficient Transformed Gaussian Processes for Non-Stationary Dependent Multi-class Classification,"Juan Maroñas Molano, Daniel Hernández-Lobato",http://arxiv.org/abs/2205.15008,,https://huggingface.co/papers/2205.15008,,,,2205.15008,2,0 Functional Neural Networks: Shift invariant models for functional data with applications to EEG classification,"Florian Heinrichs, Mavin Heim, Corinna Weber",http://arxiv.org/abs/2301.05869,,https://huggingface.co/papers/2301.05869,,,,2301.05869,3,0 A Deep Conjugate Direction Method for Iteratively Solving Linear Systems,"Ayano Kaneda, Osman Akar, Jingyu Chen, Victoria Kala, David Hyde, Joseph Teran",http://arxiv.org/abs/2205.10763,,https://huggingface.co/papers/2205.10763,,,,2205.10763,6,0 Free-Form Variational Inference for Gaussian Process State-Space Models,"Xuhui Fan, Edwin V Bonilla, Terence O'kane, Scott SIsson",http://arxiv.org/abs/2302.09921,,https://huggingface.co/papers/2302.09921,,,,2302.09921,4,0 "Bayesian Unrolling: Scalable, Inverse-Free Maximum Likelihood Estimation of Latent Gaussian Models","Alexander Lin, Bahareh Tolooshams, Yves Atchade, Demba Ba",,,,,,,,, Von Mises Mixture Distributions for Molecular Conformation Generation,"Kirk Swanson, Jake Williams, Eric Jonas",http://arxiv.org/abs/2306.07472,,https://huggingface.co/papers/2306.07472,,,,2306.07472,3,0 Unearthing InSights into Mars: Unsupervised Source Separation with Limited Data,"Ali Siahkoohi, Rudy Morel, Maarten de Hoop, Erwan Allys, Gregory Sainton, Taichi Kawamura",http://arxiv.org/abs/2301.11981,,https://huggingface.co/papers/2301.11981,,,,2301.11981,6,1 Eliminating Adversarial Noise via Information Discard and Robust Representation Restoration,"Dawei Zhou, Yukun Chen, Nannan Wang, Decheng Liu, Xinbo Gao, Tongliang Liu",,,,,,,,, Chameleon: Adapting to Peer Images for Planting Durable Backdoors in Federated Learning,"Yanbo Dai, Songze Li",http://arxiv.org/abs/2304.12961,,https://huggingface.co/papers/2304.12961,,,,2304.12961,2,0 Forget Unlearning: Towards True Data-Deletion in Machine Learning,"Rishav Chourasia, Neil Shah",http://arxiv.org/abs/2210.08911,,https://huggingface.co/papers/2210.08911,,,,2210.08911,2,0 Performative Recommendation: Diversifying Content via Strategic Incentives,"Itay Eilat, Nir Rosenfeld",http://arxiv.org/abs/2302.04336,,https://huggingface.co/papers/2302.04336,,,,2302.04336,2,0 Model Transferability with Responsive Decision Subjects,"Yatong Chen, Zeyu Tang, Kun Zhang, Yang Liu",http://arxiv.org/abs/2107.05911,,https://huggingface.co/papers/2107.05911,,,,2107.05911,4,0 Individually Fair Learning with One-Sided Feedback,"Yahav Bechavod, Aaron Roth",http://arxiv.org/abs/2206.04475,,https://huggingface.co/papers/2206.04475,,,,2206.04475,2,0 Online Learning in Stackelberg Games with an Omniscient Follower,"Geng Zhao, Banghua Zhu, Jiantao Jiao, Michael Jordan",http://arxiv.org/abs/2301.11518,,https://huggingface.co/papers/2301.11518,,,,2301.11518,4,1 Surface Snapping Optimization Layer for Single Image Object Shape Reconstruction,"Yuan-Ting Hu, Alex Schwing, Raymond A. Yeh",,,,,,,,, Robustness in Multimodal Learning under Train-Test Modality Mismatch,"Brandon McKinzie, Vaishaal Shankar, Joseph Cheng, Yinfei Yang, Jonathon Shlens, Alexander Toshev",,,,,,,,, Learning Representations without Compositional Assumptions,"Tennison Liu, Jeroen Berrevoets, Zhaozhi Qian, Mihaela van der Schaar",http://arxiv.org/abs/2305.19726,,https://huggingface.co/papers/2305.19726,,,,2305.19726,4,0 Making Transformers Compute-lite for CPU inference,"Zhanpeng Zeng, Michael Davies, Pranav Pulijala, Karthikeyan Sankaralingam, Vikas Singh",,,,,,,,, Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers,"Grant Strimel, Yi Xie, Brian King, martin radfar, Ariya Rastrow, Athanasios Mouchtaris",http://arxiv.org/abs/2305.04159,,https://huggingface.co/papers/2305.04159,,,,2305.04159,6,0 Expected Gradients of Maxout Networks and Consequences to Parameter Initialization,"Hanna Tseran, Guido Montufar",http://arxiv.org/abs/2301.06956,,https://huggingface.co/papers/2301.06956,,,,2301.06956,2,1 Competing for Shareable Arms in Multi-Player Multi-Armed Bandits,"Renzhe Xu, Haotian Wang, Xingxuan Zhang, Bo Li, Peng Cui",http://arxiv.org/abs/2305.19158,,https://huggingface.co/papers/2305.19158,,,,2305.19158,5,1 Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs,"Raif Rustamov, Subhabrata Majumdar",http://arxiv.org/abs/2010.15285,,https://huggingface.co/papers/2010.15285,,,,2010.15285,2,1 Coordinated Dynamic Bidding in Repeated Second-Price Auctions with Budgets,"Yurong Chen, Zhaohua Chen, Xiaotie Deng, Zhijian Duan, Haoran Sun, Qian Wang, Xiang Yan",http://arxiv.org/abs/2306.07709,,https://huggingface.co/papers/2306.07709,,,,2306.07709,7,0 Causal Modeling of Policy Interventions From Sequences of Treatments and Outcomes using Gaussian Processes,"Çağlar Hızlı, ST John, Anne Juuti, Tuure Saarinen, Kirsi Pietiläinen, Pekka Marttinen",,,,,,,,, A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems,"Oliver Slumbers, David Mguni, Stephen Mcaleer, Stefano Blumberg, Yaodong Yang, Jun Wang",http://arxiv.org/abs/2205.15434,,https://huggingface.co/papers/2205.15434,,,,2205.15434,6,0 SurCo: Learning SURrogate costs for COmbinatorial Nonlinear Optimization Problems,"Aaron Ferber, Taoan Huang, Daochen Zha, Martin Schubert, Benoit Steiner, Bistra Dilkina, Yuandong Tian",,,,,,,,, Fast Online Node Labeling for Very Large Graphs,"Baojian Zhou, Yifan Sun, Reza Babanezhad",http://arxiv.org/abs/2305.16257,,https://huggingface.co/papers/2305.16257,,,,2305.16257,3,0 Improving the Model Consistency of Decentralized Federated Learning,"Yifan Shi, Li Shen, Kang Wei, Yan Sun, Bo Yuan, Xueqian Wang, Dacheng Tao",http://arxiv.org/abs/2302.04083,,https://huggingface.co/papers/2302.04083,,,,2302.04083,7,0 Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization,"SIJIA CHEN, Wei-Wei Tu, Peng Zhao, Lijun Zhang",http://arxiv.org/abs/2302.04552,,https://huggingface.co/papers/2302.04552,,,,2302.04552,4,0 Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic Optimization,"Lesi Chen, Jing Xu, Luo Luo",http://arxiv.org/abs/2301.06428,,https://huggingface.co/papers/2301.06428,,,,2301.06428,3,0 One-Step Estimator for Permuted Sparse Recovery,"Hang Zhang, Ping Li",,,,,,,,, Cold Analysis of Rao-Blackwellized Straight-Through Gumbel-Softmax Gradient Estimator,Alexander Shekhovtsov,,,,,,,,, Estimating the Contamination Factor's Distribution in Unsupervised Anomaly Detection,"Lorenzo Perini, Paul Buerkner, Arto Klami",http://arxiv.org/abs/2210.10487,,https://huggingface.co/papers/2210.10487,,,,2210.10487,3,0 Image generation with shortest path diffusion,"Ayan Das, Ayan Das, Stathi Fotiadis, Anil Batra, Farhang Nabiei, FengTing Liao, Sattar Vakili, Da-shan Shiu, Alberto Bernacchia",http://arxiv.org/abs/2306.00501,,https://huggingface.co/papers/2306.00501,,,,2306.00501,8,3 Deep Anomaly Detection under Labeling Budget Constraints,"Aodong Li, Chen Qiu, Padhraic Smyth, Marius Kloft, Stephan Mandt, Maja Rudolph",http://arxiv.org/abs/2302.07832,,https://huggingface.co/papers/2302.07832,,,,2302.07832,6,1 Transformed Distribution Matching for Missing Value Imputation,"He Zhao, Ke Sun, Amir Dezfouli, Edwin V Bonilla",http://arxiv.org/abs/2302.10363,,https://huggingface.co/papers/2302.10363,,,,2302.10363,4,0 Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?,"Ruisi Cai, Zhenyu Zhang, Zhangyang “Atlas” Wang",http://arxiv.org/abs/2302.12480,,https://huggingface.co/papers/2302.12480,,,,2302.12480,3,1 Git-Theta: A Git Extension for Collaborative Development of Machine Learning Models,"Nikhil Kandpal, Brian Lester, Mohammed Muqeeth, Anisha Mascarenhas, Monty Evans, Vishal Baskaran, Tenghao Huang, Haokun Liu, Colin Raffel",,,,,,,,, Better Diffusion Models Further Improve Adversarial Training,"Zekai Wang, Tianyu Pang, Chao Du, Min Lin, Weiwei Liu, Shuicheng YAN",http://arxiv.org/abs/2302.04638,https://github.com/wzekai99/DM-Improves-AT,https://huggingface.co/papers/2302.04638,,,,2302.04638,6,2 On the Expressive Power of Geometric Graph Neural Networks,"Chaitanya Joshi, Cristian Bodnar, Simon Mathis, Taco Cohen, Pietro Lió",http://arxiv.org/abs/2301.09308,https://github.com/chaitjo/geometric-gnn-dojo,https://huggingface.co/papers/2301.09308,,,,2301.09308,5,0 Randomized Schur Complement Views for Graph Contrastive Learning,Vignesh Kothapalli,http://arxiv.org/abs/2306.04004,,https://huggingface.co/papers/2306.04004,,,,2306.04004,1,1 Path Neural Networks: Expressive and Accurate Graph Neural Networks,"Gaspard Michel, Giannis Nikolentzos, Johannes Lutzeyer, Michalis Vazirgiannis",http://arxiv.org/abs/2306.05955,,https://huggingface.co/papers/2306.05955,,,,2306.05955,4,1 Hierarchical Diffusion for Offline Decision Making,"Wenhao Li, Xiangfeng Wang, Bo Jin, Hongyuan Zha",,,,,,,,, Generated Graph Detection,"Yihan Ma, Zhikun Zhang, Ning Yu, Xinlei He, Michael Backes, Yun Shen, Yang Zhang",http://arxiv.org/abs/2306.07758,,https://huggingface.co/papers/2306.07758,,,,2306.07758,7,0 Variational Open-Domain Question Answering,"Valentin Liévin, Andreas Geert Motzfeldt, Ida Jensen, Ole Winther",http://arxiv.org/abs/2210.06345,,https://huggingface.co/papers/2210.06345,,,,2210.06345,4,2 PromptBoosting: Black-Box Text Classification with Ten Forward Passes,"Bairu Hou, Joe O'Connor, Jacob Andreas, Shiyu Chang, Yang Zhang",http://arxiv.org/abs/2212.09257,,https://huggingface.co/papers/2212.09257,,,,2212.09257,5,0 Gradient-Free Structured Pruning with Unlabeled Data,"Azade Nova, Hanjun Dai, Dale Schuurmans",http://arxiv.org/abs/2303.04185,,https://huggingface.co/papers/2303.04185,,,,2303.04185,3,0 Text-To-Concept (and Back) via Cross-Model Alignment,"Mazda Moayeri, Keivan Rezaei, Maziar Sanjabi, Soheil Feizi",http://arxiv.org/abs/2305.06386,,https://huggingface.co/papers/2305.06386,,,,2305.06386,4,2 Understand and Modularize Generator Optimization in ELECTRA-style Pretraining,"Chengyu Dong, Liyuan Liu, Hao Cheng, Jingbo Shang, Jianfeng Gao, Xiaodong Liu",,,,,,,,, Crafting Training Degradation Distribution for the Accuracy-Generalization Trade-off in Real-World Super-Resolution,"Ruofan Zhang, Ruofan Zhang, Haoyu Chen, Chao Dong, Yulun Zhang, Wenming Yang",http://arxiv.org/abs/2305.18107,,https://huggingface.co/papers/2305.18107,,,,2305.18107,6,0 Sketching for First Order Method: Efficient Algorithm for Low-Bandwidth Channel and Vulnerability,"Zhao Song, Yitan Wang, Zheng Yu, Lichen Zhang",http://arxiv.org/abs/2210.08371,,https://huggingface.co/papers/2210.08371,,,,2210.08371,4,0 Maximum Optimality Margin: A Unified Approach for Contextual Linear Programming and Inverse Linear Programming,"Chunlin Sun, Shang Liu, Xiaocheng Li",http://arxiv.org/abs/2301.11260,,https://huggingface.co/papers/2301.11260,,,,2301.11260,3,0 Quantum Policy Gradient Algorithm with Optimized Action Decoding,"Nico Meyer, Daniel Scherer, Axel Plinge, Christopher Mutschler, Michael Hartmann",http://arxiv.org/abs/2212.06663,,https://huggingface.co/papers/2212.06663,,,,2212.06663,5,0 Computational Doob h-transforms for Online Filtering of Discretely Observed Diffusions,"Nicolas Chopin, Andras Fulop, Jeremy Heng, Alex Thiery",,,,,,,,, Rethinking Warm-Starts with Predictions: Learning Predictions Close to Sets of Optimal Solutions for Faster $\text{L}$-/$\text{L}^\natural$-Convex Function Minimization,"Shinsaku Sakaue, Taihei Oki",,,,,,,,, On Distribution Dependent Sub-Logarithmic Query Time of Learned Indexing,"Sepanta Zeighami, Cyrus Shahabi",,,,,,,,, Lower Bounds for Learning in Revealing POMDPs,"Fan Chen, Huan Wang, Caiming Xiong, Song Mei, Yu Bai",http://arxiv.org/abs/2302.01333,,https://huggingface.co/papers/2302.01333,,,,2302.01333,5,1 On Computing Optimal Tree Ensembles,"Christian Komusiewicz, Pascal Kunz, Frank Sommer, Manuel Sorge",http://arxiv.org/abs/2306.04423,,https://huggingface.co/papers/2306.04423,,,,2306.04423,4,0 Partially Observable Multi-agent RL with Provable (Quasi-)Efficiency: Information-Sharing to the Rescue,"Xiangyu Liu, Kaiqing Zhang",,,,,,,,, Cross-Entropy Loss Functions: Theoretical Analysis and Applications,"Anqi Mao, Mehryar Mohri, Yutao Zhong",http://arxiv.org/abs/2304.07288,,https://huggingface.co/papers/2304.07288,,,,2304.07288,3,0 Statistical Inference and A/B Testing for First-Price Pacing Equilibria,"Luofeng Liao, Christian Kroer",http://arxiv.org/abs/2301.02276,,https://huggingface.co/papers/2301.02276,,,,2301.02276,2,0 On Many-Actions Policy Gradient,"Michal Nauman, Marek Cygan",http://arxiv.org/abs/2210.13011,,https://huggingface.co/papers/2210.13011,,,,2210.13011,2,0 Blossom: an Anytime Algorithm for Computing Optimal Decision Trees,"Emir Demirović, Emmanuel Hebrard, Louis Jean",,,,,,,,, Multi-Agent Best Arm Identification with Private Communications,"Alexandre Rio, Merwan Barlier, Igor Colin, Marta Soare",,,,,,,,, The Test of Tests: A Framework for Differentially Private Hypothesis Testing,"Zeki Kazan, Kaiyan Shi, Adam Groce, Andrew Bray",http://arxiv.org/abs/2302.04260,,https://huggingface.co/papers/2302.04260,,,,2302.04260,4,0 Understanding the Role of Feedback in Online Learning with Switching Costs,"Duo Cheng, Xingyu Zhou, Bo Ji",http://arxiv.org/abs/2306.09588,,https://huggingface.co/papers/2306.09588,,,,2306.09588,3,0 On Coresets for Clustering in Small Dimensional Euclidean spaces,"Lingxiao Huang, Ruiyuan Huang, Zengfeng Huang, Xuan Wu",http://arxiv.org/abs/2302.13737,,https://huggingface.co/papers/2302.13737,,,,2302.13737,4,0 Regret-Minimizing Double Oracle for Extensive-Form Games,"Xiaohang Tang, Le Cong Dinh, Stephen Mcaleer, Yaodong Yang",http://arxiv.org/abs/2304.10498,,https://huggingface.co/papers/2304.10498,,,,2304.10498,4,1 Prometheus: Taming Sample and Communication Complexities in Constrained Decentralized Stochastic Bilevel Learning,"Zhuqing Liu, Xin Zhang, Prashant Khanduri, Songtao Lu, Jia Liu",,,,,,,,, GC-Flow: A Graph-Based Flow Network for Effective Clustering,"Tianchun Wang, Farzaneh Mirzazadeh, Xiang Zhang, Jie Chen",,,,,,,,, The Saddle-Point Method in Differential Privacy,"Wael Alghamdi, Shahab Asoodeh, Flavio Calmon, Juan Gomez, Oliver Kosut, Lalitha Sankar",,,,,,,,, Geometric Autoencoders - What You See is What You Decode,"Philipp Nazari, Sebastian Damrich, Fred Hamprecht",,,,,,,,, Mitigating Memorization of Noisy Labels by Clipping the Model Prediction,"Hongxin Wei, HUIPING ZHUANG, RENCHUNZI XIE, LEI FENG, Gang Niu, Bo An, Yixuan Li",http://arxiv.org/abs/2212.04055,,https://huggingface.co/papers/2212.04055,,,,2212.04055,7,0 Dink-Net: Neural Clustering on Large Graphs,"Yue Liu, KE LIANG, Jun Xia, sihang zhou, xihong yang, Xinwang Liu, Stan Z Li",,,,,,,,, RLEG: Vision-Language Representation Learning with Diffusion-based Embedding Generation,"Liming Zhao, Liming Zhao, Kecheng Zheng, Yun Zheng, Deli Zhao, Jingren Zhou",,,,,,,,, Cooperation in the Latent Space: The Benefits of Adding Mixture Components in Variational Autoencoders,"Oskar Kviman, Ricky Molén, Alexandra Hotti, Semih Kurt, Víctor Elvira, Jens Lagergren",,,,,,,,, Social learning spontaneously emerges by searching optimal heuristics with deep reinforcement learning,"Seungwoong Ha, Hawoong Jeong",http://arxiv.org/abs/2204.12371,,https://huggingface.co/papers/2204.12371,,,,2204.12371,2,0 Fast Excess Risk Rates via Offset Rademacher Complexity,"Chenguang Duan, Yuling Jiao, Lican Kang, Xiliang Lu, Jerry Yang",,,,,,,,, Continual Learning in Linear Classification on Separable Data,"Itay Evron, Edward Moroshko, Gon Buzaglo, Maroun Khriesh, Badea Marjieh, Nati Srebro, Daniel Soudry",http://arxiv.org/abs/2306.03534,,https://huggingface.co/papers/2306.03534,,,,2306.03534,7,0 Emergence of Adaptive Circadian Rhythms in Deep Reinforcement Learning,"aqeel labash, Florian Stelzer, Daniel Majoral Lopez, Raul Vicente",,,,,,,,, Transcendental Idealism of Planner: Evaluating Perception from Planning Perspective for Autonomous Driving,"Wei-Xin Li, Xiaodong Yang",http://arxiv.org/abs/2306.07276,,https://huggingface.co/papers/2306.07276,,,,2306.07276,2,0 Computational Asymmetries in Robust Classification,"Samuele Marro, Michele Lombardi",,,,,,,,, Curriculum Co-disentangled Representation Learning across Multiple Environments for Social Recommendation,"Xin Wang, Zirui Pan, Yuwei Zhou, Hong Chen, Chendi Ge, Wenwu Zhu",,,,,,,,, Better Training of GFlowNets with Local Credit and Incomplete Trajectories,"Ling Pan, Nikolay Malkin, Dinghuai Zhang, Yoshua Bengio",http://arxiv.org/abs/2302.01687,,https://huggingface.co/papers/2302.01687,,,,2302.01687,4,2 Multi-channel Autobidding with Budget and ROI Constraints,"Yuan Deng, Negin Golrezaei, Patrick Jaillet, Jason Cheuk Nam Liang, Vahab Mirrokni",http://arxiv.org/abs/2302.01523,,https://huggingface.co/papers/2302.01523,,,,2302.01523,5,0 Random Teachers are Good Teachers,"Felix Sarnthein, Gregor Bachmann, Sotiris Anagnostidis, Thomas Hofmann",http://arxiv.org/abs/2302.12091,,https://huggingface.co/papers/2302.12091,,,,2302.12091,4,0 Efficient Distribution-Free Predictive Inference for Standard and Feedback Covariate Shift,"Andrew Prinster, Suchi Saria, Anqi Liu",,,,,,,,, Equivariant Polynomials for Graph Neural Networks,"Omri Puny, Derek Lim, Bobak T Kiani, Haggai Maron, Yaron Lipman",http://arxiv.org/abs/2302.11556,,https://huggingface.co/papers/2302.11556,,,,2302.11556,5,0 Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally Coupled Oscillatory Recurrent Neural Networks,"T. Anderson Keller, Max Welling",,,,,,,,, Fair and Accurate Decision Making through Group-Aware Learning,"Ramtin Hosseini, Li Zhang, Bhanu Garg, Pengtao Xie",,,,,,,,, Nonlinear Advantage: Trained Networks Might Not Be As Complex as You Think,"Christian H.X. Ali Mehmeti-Göpel, Jan Disselhoff",http://arxiv.org/abs/2211.17180,,https://huggingface.co/papers/2211.17180,,,,2211.17180,2,0 Bidirectional Learning for Offline Model-based Biological Sequence Design,"Can Chen, Yingxue Zhang, Xue Liu, Mark Coates",http://arxiv.org/abs/2301.02931,,https://huggingface.co/papers/2301.02931,,,,2301.02931,4,0 SeedGNN: Graph Neural Network for Supervised Seeded Graph Matching,"Liren Yu, Xiaojun Lin, Jiaming Xu",,,,,,,,, Loss Balancing for Fair Supervised Learning,"Mahdi Khalili, Xueru Zhang, Mahed Abroshan",,,,,,,,, TAN Without a Burn: Scaling Laws of DP-SGD,"Tom Sander, Tom Sander, Pierre Stock, Alexandre Sablayrolles",http://arxiv.org/abs/2210.03403,,https://huggingface.co/papers/2210.03403,,,,2210.03403,3,1 Controllable Neural Symbolic Regression,"Tommaso Bendinelli, Luca Biggio, Pierre-Alexandre Kamienny",http://arxiv.org/abs/2304.10336,,https://huggingface.co/papers/2304.10336,,,,2304.10336,3,0 Predictable MDP Abstraction for Unsupervised Model-Based RL,"Seohong Park, Sergey Levine",http://arxiv.org/abs/2302.03921,,https://huggingface.co/papers/2302.03921,,,,2302.03921,2,0 Deep Laplacian-based Options for Temporally-Extended Exploration,"Martin Klissarov, Marlos C. Machado",http://arxiv.org/abs/2301.11181,,https://huggingface.co/papers/2301.11181,,,,2301.11181,2,0 Structure Learning of Latent Factors via Clique Search on Correlation Thresholded Graphs,"Dale Kim, Qing Zhou",http://arxiv.org/abs/2203.01471,,https://huggingface.co/papers/2203.01471,,,,2203.01471,2,0 Rethinking Visual Reconstruction: Experience-Based Content Completion Guided by Visual Cues,"Jiaxuan Chen, Yu Qi, Gang Pan",,,,,,,,, Continuously Parameterized Mixture Models,"Christopher Bender, Yifeng Shi, Marc Niethammer, Junier Oliva",,,,,,,,, Estimating Heterogeneous Treatment Effects: Mutual Information Bounds and Learning Algorithms,"Xingzhuo Guo, Yuchen Zhang, Jianmin Wang, Mingsheng Long",,,,,,,,, Best Arm Identification in Multi-Agent Multi-Armed Bandits,"Filippo Vannella, Alexandre Proutiere, Jaeseong Jeong",,,,,,,,, Label differential privacy and private training data release,"Robert Busa-Fekete, andres munoz, Umar Syed, Sergei Vassilvitskii",,,,,,,,, Data Efficient Neural Scaling Law via Model Reusing,"Peihao Wang, Rameswar Panda, Zhangyang “Atlas” Wang",,,,,,,,, Truncating Trajectories in Monte Carlo Reinforcement Learning,"Riccardo Poiani, Alberto Maria Metelli, Marcello Restelli",http://arxiv.org/abs/2305.04361,,https://huggingface.co/papers/2305.04361,,,,2305.04361,3,0 The Value of Out-of-Distribution Data,"Ashwin De Silva, Rahul Ramesh, Carey Priebe, Pratik Chaudhari, Joshua Vogelstein",http://arxiv.org/abs/2208.10967,,https://huggingface.co/papers/2208.10967,,,,2208.10967,5,1 Actor-Critic Alignment for Offline-to-Online Reinforcement Learning,"Zishun Yu, Xinhua Zhang",,,,,,,,, Locally Regularized Neural Differential Equations: Some Black Boxes were meant to remain closed!,"Avik Pal, Alan Edelman, Christopher Rackauckas",http://arxiv.org/abs/2303.02262,,https://huggingface.co/papers/2303.02262,,,,2303.02262,3,1 ReDi: Efficient Learning-Free Diffusion Inference via Trajectory Retrieval,"Kexun Zhang, Xianjun Yang, William Wang, Lei Li",http://arxiv.org/abs/2302.02285,,https://huggingface.co/papers/2302.02285,,,,2302.02285,4,2 The Monge Gap: A Regularizer to Learn All Transport Maps,"Théo Uscidda, Marco Cuturi",http://arxiv.org/abs/2302.04953,,https://huggingface.co/papers/2302.04953,,,,2302.04953,2,0 AbODE: Ab initio antibody design using conjoined ODEs,"Yogesh Verma, Markus Heinonen, Vikas K Garg",http://arxiv.org/abs/2306.01005,,https://huggingface.co/papers/2306.01005,,,,2306.01005,3,0 Learning-augmented private algorithms for multiple quantile release,"Mikhail Khodak, Kareem Amin, Travis Dick, Sergei Vassilvitskii",http://arxiv.org/abs/2210.11222,,https://huggingface.co/papers/2210.11222,,,,2210.11222,4,0 Horizon-free Learning for Markov Decision Processes and Games: Stochastically Bounded Rewards and Improved Bounds,"Shengshi Li, Lin Yang",,,,,,,,, Variational Autoencoding Neural Operators,"Jacob H. Seidman, Georgios Kissas, George J. Pappas, Paris Perdikaris",http://arxiv.org/abs/2302.10351,,https://huggingface.co/papers/2302.10351,,,,2302.10351,4,1 Efficient Parametric Approximations of Neural Network Function Space Distance,"Nikita Dhawan, Sicong Huang, Juhan Bae, Roger Grosse",http://arxiv.org/abs/2302.03519,,https://huggingface.co/papers/2302.03519,,,,2302.03519,4,0 Theory on Forgetting and Generalization of Continual Learning,"Sen Lin, Peizhong Ju, Yingbin LIANG, Ness Shroff",http://arxiv.org/abs/2302.05836,,https://huggingface.co/papers/2302.05836,,,,2302.05836,4,0 Trapdoor Normalization with Irreversible Ownership Verification,"Hanwen Liu, Zhenyu Weng, Yuesheng Zhu, Yadong Mu",,,,,,,,, Discover and Cure: Concept-aware Mitigation of Spurious Correlation,"Yingxin Wu, Mert Yuksekgonul, Linjun Zhang, James Zou",http://arxiv.org/abs/2305.00650,https://github.com/Wuyxin/DISC,https://huggingface.co/papers/2305.00650,,,,2305.00650,4,0 DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization,"Adel Nabli, Edouard Oyallon",http://arxiv.org/abs/2208.00779,,https://huggingface.co/papers/2208.00779,,,,2208.00779,2,0 DoCoFL: Downlink Compression for Cross-Device Federated Learning,"Ron Dorfman, Shay Vargaftik, Yaniv Ben Itzhak, Kfir Levy",,,,,,,,, Compositional Exemplars for In-context Learning,"Jiacheng Ye, Zhiyong Wu, Jiangtao Feng, Tao Yu, Lingpeng Kong",http://arxiv.org/abs/2302.05698,https://github.com/HKUNLP/icl-ceil,https://huggingface.co/papers/2302.05698,,,,2302.05698,5,1 Generating Language Corrections for Teaching Physical Control Tasks,"Megha Srivastava, Noah Goodman, Dorsa Sadigh",http://arxiv.org/abs/2306.07012,,https://huggingface.co/papers/2306.07012,,,,2306.07012,3,1 Does Continual Learning Equally Forget All Parameters?,"Haiyan Zhao, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang",http://arxiv.org/abs/2304.04158,,https://huggingface.co/papers/2304.04158,,,,2304.04158,5,1 Sequential Changepoint Detection via Backward Confidence Sequences,"Shubhanshu Shekhar, Aaditya Ramdas",,,,,,,,, simple diffusion: End-to-end diffusion for high resolution images,"Emiel Hoogeboom, Jonathan Heek, Tim Salimans",http://arxiv.org/abs/2301.11093,,https://huggingface.co/papers/2301.11093,,,,2301.11093,3,0 Layered State Discovery for Incremental Autonomous Exploration,"Liyu Chen, Andrea Tirinzoni, Alessandro Lazaric, Matteo Pirotta",http://arxiv.org/abs/2302.03789,,https://huggingface.co/papers/2302.03789,,,,2302.03789,4,0 CataBEEM: Integrating Latent Interaction Categories in Node-wise Community Detection Models for Network Data,"Yuhua Zhang, Walter Dempsey",,,,,,,,, QuantumDARTS: Differentiable Quantum Architecture Search for Variational Quantum Algorithms,"Wenjie Wu, Ge Yan, Xudong Lu, Kaisen Pan, Junchi Yan",,,,,,,,, Language Instructed Reinforcement Learning for Human-AI Coordination,"Hengyuan Hu, Dorsa Sadigh",http://arxiv.org/abs/2304.07297,,https://huggingface.co/papers/2304.07297,,,,2304.07297,2,1 A Neural PDE Solver with Temporal Stencil Modeling,"Zhiqing Sun, Yiming Yang, Shinjae Yoo",http://arxiv.org/abs/2302.08105,https://github.com/Edward-Sun/TSM-PDE,https://huggingface.co/papers/2302.08105,,,,2302.08105,3,0 Deep Clustering with Incomplete Noisy Pairwise Annotations: A Geometric Regularization Approach,"Tri Nguyen, Shahana Ibrahim, Xiao Fu",http://arxiv.org/abs/2305.19391,,https://huggingface.co/papers/2305.19391,,,,2305.19391,3,0 Distance Weighted Supervised Learning for Offline Interaction Data,"Joey Hejna, Jensen Gao, Dorsa Sadigh",http://arxiv.org/abs/2304.13774,,https://huggingface.co/papers/2304.13774,,,,2304.13774,3,0 Recovering Top-Two Answers and Confusion Probability in Multi-Choice Crowdsourcing,"Hyeonsu Jeong, Hye Won Chung",http://arxiv.org/abs/2301.00006,,https://huggingface.co/papers/2301.00006,,,,2301.00006,2,0 Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees,"Pengfei Li, Jianyi Yang, Shaolei Ren",http://arxiv.org/abs/2306.00172,https://github.com/Ren-Research/LOMAR,https://huggingface.co/papers/2306.00172,,,,2306.00172,3,0 Learning Intuitive Policies Using Action Features,"Mingwei Ma, Jizhou Liu, Samuel Sokota, Max Kleiman-Weiner, Jakob Foerster",http://arxiv.org/abs/2201.12658,,https://huggingface.co/papers/2201.12658,,,,2201.12658,5,0 Long-Tailed Recognition by Mutual Information Maximization between Latent Features and Ground-Truth Labels,"Min-Kook Suh, Seung-Woo Seo",http://arxiv.org/abs/2305.01160,,https://huggingface.co/papers/2305.01160,,,,2305.01160,2,0 Improving Graph Neural Networks with Learnable Propagation Operators,"Moshe Eliasof, Lars Ruthotto, Eran Treister",http://arxiv.org/abs/2210.17224,,https://huggingface.co/papers/2210.17224,,,,2210.17224,3,0 Effective Neural Topic Modeling with Embedding Clustering Regularization,"Xiaobao Wu, Xinshuai Dong, Thong Nguyen, Anh Tuan Luu",http://arxiv.org/abs/2306.04217,,https://huggingface.co/papers/2306.04217,,,,2306.04217,4,1 Supported Trust Region Optimization for Offline Reinforcement Learning,"Yixiu Mao, Hongchang Zhang, Chen Chen, Yi Xu, Xiangyang Ji",,,,,,,,, Generative Adversarial Symmetry Discovery,"Jianke Yang, Robin Walters, Nima Dehmamy, Rose Yu",http://arxiv.org/abs/2302.00236,,https://huggingface.co/papers/2302.00236,,,,2302.00236,4,1 On Excess Mass Behavior in Gaussian Mixture Models with Orlicz-Wasserstein Distances,"Aritra Guha, Nhat Ho, XuanLong Nguyen",http://arxiv.org/abs/2301.11496,,https://huggingface.co/papers/2301.11496,,,,2301.11496,3,0 Repository-Level Prompt Generation for Large Language Models of Code,"Disha Shrivastava, Hugo Larochelle, Daniel Tarlow",http://arxiv.org/abs/2206.12839,https://github.com/shrivastavadisha/repo_level_prompt_generation,https://huggingface.co/papers/2206.12839,,,,2206.12839,3,1 Offline Learning in Markov Games with General Function Approximation,"Yuheng Zhang, Yu Bai, Nan Jiang",http://arxiv.org/abs/2302.02571,,https://huggingface.co/papers/2302.02571,,,,2302.02571,3,1 Fairness in Matching under Uncertainty,"Siddartha Devic, David Kempe, Vatsal Sharan, Aleksandra Korolova",http://arxiv.org/abs/2302.03810,,https://huggingface.co/papers/2302.03810,,,,2302.03810,4,1 Tensor Gaussian Process with Contraction for Multi-Channel Imaging Analysis,"Hu Sun, Ward Manchester, Meng Jin, Yang Liu, Yang Chen",http://arxiv.org/abs/2301.11203,,https://huggingface.co/papers/2301.11203,,,,2301.11203,5,0 A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard Constraints,"Ming Shi, Yingbin LIANG, Ness Shroff",http://arxiv.org/abs/2302.04375,,https://huggingface.co/papers/2302.04375,,,,2302.04375,3,0 How Many Perturbations Break This Model? Evaluating Robustness Beyond Adversarial Accuracy,"Raphaël Olivier, Bhiksha Raj",http://arxiv.org/abs/2207.04129,,https://huggingface.co/papers/2207.04129,,,,2207.04129,2,1 Understanding Self-Distillation in the Presence of Label Noise,"Rudrajit Das, Sujay Sanghavi",http://arxiv.org/abs/2301.13304,,https://huggingface.co/papers/2301.13304,,,,2301.13304,2,0 Data-Driven Subgroup Discovery for Linear Regression,"Zachary Izzo, Ruishan Liu, James Zou",,,,,,,,, Smooth Non-stationary Bandits ,"Su Jia, Qian Xie, Nathan Kallus, Peter I Frazier",,,,,,,,, Formalizing Preferences Over Runtime Distributions,"Devon Graham, Kevin Leyton-Brown, Tim Roughgarden",http://arxiv.org/abs/2205.13028,,https://huggingface.co/papers/2205.13028,,,,2205.13028,3,1 Second-order regression models exhibit progressive sharpening to the edge of stability,"Atish Agarwala, Fabian Pedregosa, Jeffrey Pennington",http://arxiv.org/abs/2210.04860,,https://huggingface.co/papers/2210.04860,,,,2210.04860,3,0 Scaling Spherical CNNs,"Carlos Esteves, Jean-Jacques Slotine, Ameesh Makadia",http://arxiv.org/abs/2306.05420,https://github.com/google-research/spherical-cnn,https://huggingface.co/papers/2306.05420,,,,2306.05420,3,0 Hardware-Aware Compression with Random Operation Access Specific Tile (ROAST) Hashing,"Aditya Desai, Keren Zhou, Anshumali Shrivastava",,,,,,,,, A Toy Model of Universality: Reverse Engineering how Networks Learn Group Operations,"Bilal Chughtai, Lawrence Chan, Neel Nanda",http://arxiv.org/abs/2302.03025,,https://huggingface.co/papers/2302.03025,,,,2302.03025,3,1 Fully Bayesian Autoencoders with Latent Sparse Gaussian Processes,"Ba-Hien Tran, Babak Shahbaba, Stephan Mandt, Maurizio Filippone",http://arxiv.org/abs/2302.04534,,https://huggingface.co/papers/2302.04534,,,,2302.04534,4,0 Why Random Pruning Is All We Need to Start Sparse,"Advait Gadhikar, Sohom Mukherjee, Rebekka Burkholz",http://arxiv.org/abs/2210.02412,,https://huggingface.co/papers/2210.02412,,,,2210.02412,3,0 SpotEM: Efficient Video Search for Episodic Memory,"Santhosh Kumar Ramakrishnan, Ziad Al-Halah, Kristen Grauman",,,,,,,,, Multi-task Hierarchical Adversarial Inverse Reinforcement Learning,"Jiayu Chen, Dipesh Tamboli, Tian Lan, Vaneet Aggarwal",http://arxiv.org/abs/2305.12633,,https://huggingface.co/papers/2305.12633,,,,2305.12633,4,1 Unscented Autoencoder,"Faris Janjoš, Lars Rosenbaum, Maxim Dolgov, J. Marius Zoellner",http://arxiv.org/abs/2306.05256,,https://huggingface.co/papers/2306.05256,,,,2306.05256,4,0 Vector-Valued Control Variates,"Zhuo Sun, Alessandro Barp, Francois-Xavier Briol",http://arxiv.org/abs/2109.08944,,https://huggingface.co/papers/2109.08944,,,,2109.08944,3,0 Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost,"Sanae Amani, Tor Lattimore, Andras Gyorgy, Lin Yang",http://arxiv.org/abs/2205.13170,,https://huggingface.co/papers/2205.13170,,,,2205.13170,4,0 Fair Densities via Boosting the Sufficient Statistics of Exponential Families,"Alexander Soen, Hisham Husain, Richard Nock",http://arxiv.org/abs/2012.00188,,https://huggingface.co/papers/2012.00188,,,,2012.00188,3,0 Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes,"Runlong Zhou, Ruosong Wang, Simon Du",http://arxiv.org/abs/2210.11604,,https://huggingface.co/papers/2210.11604,,,,2210.11604,3,0 A Fast Optimistic Method for Monotone Variational Inequalities,"Michael Sedlmayer, Dang-Khoa Nguyen, Radu Ioan Bot",,,,,,,,, Revisiting Simple Regret: Fast Rates for Returning a Good Arm,"Yao Zhao, Connor J Stephens, Csaba Szepesvari, Kwang-Sung Jun",http://arxiv.org/abs/2210.16913,,https://huggingface.co/papers/2210.16913,,,,2210.16913,4,0 Escaping saddle points in zeroth-order optimization: the power of two-point estimators,"Zhaolin Ren, Yujie Tang, Na Li",http://arxiv.org/abs/2209.13555,,https://huggingface.co/papers/2209.13555,,,,2209.13555,3,1 Beyond the Edge of Stability via Two-step Gradient Updates,"Lei Chen, Joan Bruna",,,,,,,,, Perturbation Analysis of Neural Collapse,"Tom Tirer, Haoxiang Huang, Jonathan Niles-Weed",http://arxiv.org/abs/2210.16658,,https://huggingface.co/papers/2210.16658,,,,2210.16658,3,0 The Implicit Regularization of Dynamical Stability in Stochastic Gradient Descent,"Lei Wu, Weijie Su",http://arxiv.org/abs/2305.17490,,https://huggingface.co/papers/2305.17490,,,,2305.17490,2,0 Conformal Inference is (almost) Free for Neural Networks Trained with Early Stopping,"Ziyi Liang, Yanfei Zhou, Matteo Sesia",http://arxiv.org/abs/2301.11556,,https://huggingface.co/papers/2301.11556,,,,2301.11556,3,0 Tight Certification of Adversarially Trained Neural Networks via Nonconvex Low-Rank Semidefinite Relaxations,"Hong-Ming Chiu, Richard Zhang",http://arxiv.org/abs/2211.17244,,https://huggingface.co/papers/2211.17244,,,,2211.17244,2,0 Learning Subpocket Prototypes for Generalizable Structure-based Drug Design,"Zaixi Zhang, Qi Liu",http://arxiv.org/abs/2305.13997,,https://huggingface.co/papers/2305.13997,,,,2305.13997,2,0 Learning Antidote Data to Individual Unfairness,"Peizhao Li, Ethan Xia, Hongfu Liu",http://arxiv.org/abs/2211.15897,,https://huggingface.co/papers/2211.15897,,,,2211.15897,3,1 MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations,"Anqi Li, Byron Boots, Ching-An Cheng",http://arxiv.org/abs/2303.17156,,https://huggingface.co/papers/2303.17156,,,,2303.17156,3,0 Answering Complex Logical Queries on Knowledge Graphs via Query Computation Tree Optimization,"Yushi Bai, Xin Lv, Juanzi Li, Lei Hou",http://arxiv.org/abs/2212.09567,https://github.com/bys0318/QTO,https://huggingface.co/papers/2212.09567,,,,2212.09567,4,1 Rethink DARTS Search Space and Renovate a New Benchmark,"Jiuling Zhang, Zhingming Ding",http://arxiv.org/abs/2306.06852,https://github.com/chaoji90/LHD,https://huggingface.co/papers/2306.06852,,,,2306.06852,2,1 Why Target Networks Stabilise Temporal Difference Methods,"Matthew Smith, Mattie Fellows, Shimon Whiteson",http://arxiv.org/abs/2302.12537,,https://huggingface.co/papers/2302.12537,,,,2302.12537,3,0 SGD-induced drift of representation in a two-layer neural network,"Farhad Pashakhanloo, Alexei Koulakov",,,,,,,,, Polyhedral Complex Extraction from ReLU Networks using 1-skeleton,Arturs Berzins,,,,,,,,, "Trainability, Expressivity and Interpretability in Gated Neural ODEs","Tim Kim, Tankut U Can, Kamesh Krishnamurthy",,,,,,,,, Task-specific experimental design for treatment effect estimation,"Bethany Connolly, Kimberley Moore, Tobias Schwedes, Alexander Adam, Gary Willis, Ilya Feige, Christopher Frye",http://arxiv.org/abs/2306.05484,,https://huggingface.co/papers/2306.05484,,,,2306.05484,7,0 Accelerated Cyclic Coordinate Dual Averaging with Extrapolation for Composite Convex Optimization,"Cheuk Yin Lin, Chaobing Song, Jelena Diakonikolas",http://arxiv.org/abs/2303.16279,,https://huggingface.co/papers/2303.16279,,,,2303.16279,3,0 Subequivariant Graph Reinforcement Learning in 3D Environments,"Runfa Chen, Jiaqi Han, Fuchun Sun, Wenbing Huang",http://arxiv.org/abs/2305.18951,,https://huggingface.co/papers/2305.18951,,,,2305.18951,4,0 Representation Learning with Multi-Step Inverse Kinematics: An Efficient and Optimal Approach to Rich-Observation RL,"Zakaria Mhammedi, Dylan Foster, Alexander Rakhlin",http://arxiv.org/abs/2304.05889,,https://huggingface.co/papers/2304.05889,,,,2304.05889,3,0 Do Perceptually Aligned Gradients Imply Robustness?,"Roy Ganz, Bahjat Kawar, Michael Elad",,,,,,,,, Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels,"Sai Rajeswar, Pietro Mazzaglia, Tim Verbelen, Alex Piche, Bart Dhoedt, Aaron Courville, Alexandre Lacoste",http://arxiv.org/abs/2209.12016,,https://huggingface.co/papers/2209.12016,,,,2209.12016,7,1 Unifying Nesterov's Accelerated Gradient Methods for Convex and Strongly Convex Objective Functions,"Jungbin Kim, Insoon Yang",,,,,,,,, "Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models","Hong Liu, Sang Michael Xie, Zhiyuan Li, Tengyu Ma",http://arxiv.org/abs/2210.14199,,https://huggingface.co/papers/2210.14199,,,,2210.14199,4,1 Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models,"Luke Vilnis, Yury Zemlyanskiy, Patrick Murray, Alexandre Passos, Sumit Sanghai",http://arxiv.org/abs/2210.15458,,https://huggingface.co/papers/2210.15458,,,,2210.15458,5,1 Cross-Modal Fine-Tuning: Align then Refine,"Junhong Shen, Liam Li, Lucio Dery, Corey Staten, Mikhail Khodak, Graham Neubig, Ameet Talwalkar",,,,,,,,, Why does Throwing Away Data Improve Worst-Group Error?,"Kamalika Chaudhuri, Kartik Ahuja, Martin Arjovsky, David Lopez-Paz",http://arxiv.org/abs/2205.11672,,https://huggingface.co/papers/2205.11672,,,,2205.11672,4,0 Interventional Causal Representation Learning,"Kartik Ahuja, Divyat Mahajan, Yixin Wang, Yoshua Bengio",http://arxiv.org/abs/2209.11924,,https://huggingface.co/papers/2209.11924,,,,2209.11924,4,0 Reparameterized Policy Learning for Multimodal Trajectory Optimization,"Zhiao Huang, Litian Liang, Zhan Ling, Xuanlin Li, Chuang Gan, Hao Su",,,,,,,,, Learning Signed Distance Functions from Noisy 3D Point Clouds via Noise to Noise Mapping,"Baorui Ma, Yushen Liu, Zhizhong Han",http://arxiv.org/abs/2306.01405,https://github.com/mabaorui/Noise2NoiseMapping,https://huggingface.co/papers/2306.01405,,,,2306.01405,3,0 Over-parametrization via Lifting for Low-rank Matrix Sensing: Conversion of Spurious Solutions to Strict Saddle Points,"Ziye Ma, Igor Molybog, Javad Lavaei, Somayeh Sojoudi",http://arxiv.org/abs/2302.07828,,https://huggingface.co/papers/2302.07828,,,,2302.07828,4,0 Sequential Underspecified Instrument Selection for Cause-Effect Estimation,"Elisabeth Ailer, Jason Hartford, Niki Kilbertus",http://arxiv.org/abs/2302.05684,,https://huggingface.co/papers/2302.05684,,,,2302.05684,3,1 Brauer's Group Equivariant Neural Networks,Edward Pearce-Crump,http://arxiv.org/abs/2212.08630,,https://huggingface.co/papers/2212.08630,,,,2212.08630,1,1 Learning-Rate-Free Learning by D-Adaptation,"Aaron Defazio, Konstantin Mishchenko",,,,,,,,,