Edgar(@edgarriba) 's Twitter Profileg
Edgar

@edgarriba

Creator of @kornia_foss and co-founder of https://t.co/mYuSg1ClqG | Computer Vision Researcher

ID:249701367

linkhttps://github.com/edgarriba calendar_today09-02-2011 16:06:48

1,8K Tweets

1,6K Followers

1,2K Following

Kornia(@kornia_foss) 's Twitter Profile Photo

Specularity Factorization for Low-Light Enhancement

Saurabh Saini, P J Narayanan

tl;dr: estimate multiple (model-based) noise factors to image enhancement.
used for differentiable bilateral filtering.
arxiv.org/abs/2404.01998…
sophont01.github.io/data/projects/…

Specularity Factorization for Low-Light Enhancement Saurabh Saini, P J Narayanan tl;dr: estimate multiple (model-based) noise factors to image enhancement. #kornia used for differentiable bilateral filtering. arxiv.org/abs/2404.01998… sophont01.github.io/data/projects/…
account_circle
Kornia(@kornia_foss) 's Twitter Profile Photo

MuST: Robust Image Watermarking for Multi-Source Tracing

Guanjie Wang, Zehua Ma,Chang Liu, Xi Yang, Han Fang, Weiming Zhang, Nenghai Yu

tl;dr: watermarks for collages - to trace all inputs. used for differentiable data augmentation.

ojs.aaai.org/index.php/AAAI…

MuST: Robust Image Watermarking for Multi-Source Tracing Guanjie Wang, Zehua Ma,Chang Liu, Xi Yang, Han Fang, Weiming Zhang, Nenghai Yu tl;dr: watermarks for collages - to trace all inputs. #kornia used for differentiable data augmentation. ojs.aaai.org/index.php/AAAI…
account_circle
Jao(@JoaoAtkn) 's Twitter Profile Photo

we've also added a small feature that allows us to use dictionaries in the AugmentationSequential container instead of data keys.

more details in the docs: kornia.readthedocs.io/en/stable/augm…

we've also added a small feature that allows us to use dictionaries in the AugmentationSequential container instead of data keys. more details in the docs: kornia.readthedocs.io/en/stable/augm…
account_circle
Kornia(@kornia_foss) 's Twitter Profile Photo

0.7.2 is out!
- Added DeDoDe features (thanks Johan Edstedt )
- LightGlue models, available nowhere else - DeDoDe (B/G), KeyNet-HardNet
- KMeans implementation
- New augmentations: RandomGaussianIllumination, RandomLinearIllumination, RandomLinearCorner
1/2
github.com/kornia/kornia/…

0.7.2 is out! - Added DeDoDe features (thanks @Parskatt ) - LightGlue models, available nowhere else - DeDoDe (B/G), KeyNet-HardNet - KMeans implementation - New augmentations: RandomGaussianIllumination, RandomLinearIllumination, RandomLinearCorner 1/2 github.com/kornia/kornia/…
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

Multilinear Operator Networks

Yixin Cheng, Grigoris Chrysos, Markos Georgopoulos, Volkan Cevher

tl;dr: element-wise multiplication and layer norm is all you need (pun intended)

arxiv.org/abs/2401.17992

Multilinear Operator Networks Yixin Cheng, @Grigoris_c, Markos Georgopoulos, @CevherLIONS tl;dr: element-wise multiplication and layer norm is all you need (pun intended) arxiv.org/abs/2401.17992
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

Grandmaster-Level Chess Without Search

Anian Ruoss, Grégoire Delétang, Sourabh Medapati, Jordi Grau-Moya, Kevin Li, Elliot Catt, John Reid, Tim Genewein

tl;dr: supervised learning labeled by Stockfish FTW.
Many ablations, also scaling experiment
arxiv.org/abs/2402.04494…

Grandmaster-Level Chess Without Search @anianruoss, @gregdeletang, @activelifetribe, @jordigraumo, @liwenliang, Elliot Catt, @__Reidy__, Tim Genewein tl;dr: supervised learning labeled by Stockfish FTW. Many ablations, also scaling experiment arxiv.org/abs/2402.04494…
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

Natasha Butt, Blaze(j) Manczak 🇵🇱🇱🇺🇪🇺, Auke Wiggers, Corrado Rainone, David Zhang, Michaël Defferrard, Taco Cohen

tl;dr: sample a program, try it, add to the replay pool.
New sota on ARC
arxiv.org/abs/2402.04858…

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay @NatashaEve4, @blazejmanczak, @aukejw, Corrado Rainone, David Zhang, @m_deff, @TacoCohen tl;dr: sample a program, try it, add to the replay pool. New sota on ARC arxiv.org/abs/2402.04858…
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

A Single Simple Patch is All You Need for AI-generated Image Detection

Jiaxuan Chen, Jieteng Yao, Li Niu
arxiv.org/abs/2402.01123

tl;dr: image generators produce 'cleaner' images than real, makes easy to detect.
My comment: compare vs high-quality stock photo, not ImageNet?

A Single Simple Patch is All You Need for AI-generated Image Detection Jiaxuan Chen, Jieteng Yao, Li Niu arxiv.org/abs/2402.01123 tl;dr: image generators produce 'cleaner' images than real, makes easy to detect. My comment: compare vs high-quality stock photo, not ImageNet?
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

Extreme Two-View Geometry From Object Poses
with Diffusion Models

Yujing Sun, Caiyi Sun, Yuan Liu, Yuexin Ma, Siu Ming Yiu

tl;dr:
ASIFT/MODS view generation for matching meets diffusion.
arxiv.org/abs/2402.02800

Extreme Two-View Geometry From Object Poses with Diffusion Models Yujing Sun, Caiyi Sun, Yuan Liu, Yuexin Ma, Siu Ming Yiu tl;dr: ASIFT/MODS view generation for matching meets diffusion. arxiv.org/abs/2402.02800
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

Region-Based Representations Revisited

Michal Shlapentokh-Rothman, Ansel Blume, Yao Xiao, Yuqun Wu, Sethuraman T V, Heyi Tao, Jae Yong Lee, Wilfredo Torres, Yu-Xiong Wang, Derek Hoiem

tl;dr: in title, SLIC helps SAM.
arxiv.org/pdf/2402.02352…

Region-Based Representations Revisited Michal Shlapentokh-Rothman, Ansel Blume, Yao Xiao, Yuqun Wu, Sethuraman T V, Heyi Tao, Jae Yong Lee, Wilfredo Torres, Yu-Xiong Wang, Derek Hoiem tl;dr: in title, SLIC helps SAM. arxiv.org/pdf/2402.02352…
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

CLIP Can Understand Depth

Dunam Kim, Seokju Lee

tl;dr: learn input embedding for the monodepth, then use it for the input -> not sota, but quite good, unlike previous CLIP based depth.
arxiv.org/abs/2402.03251…

CLIP Can Understand Depth Dunam Kim, Seokju Lee tl;dr: learn input embedding for the monodepth, then use it for the input -> not sota, but quite good, unlike previous CLIP based depth. arxiv.org/abs/2402.03251…
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

MESA: Matching Everything by Segmenting Anything

Yesheng Zhang, Xu Zhao

tl;dr: use SAM as region detector, match regions, then get point correspondences using traditional matchers.
No eval on IMC. Also, no eval with SG or LightGlue.

arxiv.org/abs/2401.16741

MESA: Matching Everything by Segmenting Anything Yesheng Zhang, Xu Zhao tl;dr: use SAM as region detector, match regions, then get point correspondences using traditional matchers. No eval on IMC. Also, no eval with SG or LightGlue. arxiv.org/abs/2401.16741
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation

Xiaoming Zhao, Xingming Wu, Weihai Chen, Peter C. Y. Chen, Qingsong Xu, Zhengguo Li

tl;dr: Journal ALIKE, many arch ablations. as good as DISK, but much faster

arxiv.org/abs/2304.03608…

ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation Xiaoming Zhao, Xingming Wu, Weihai Chen, Peter C. Y. Chen, Qingsong Xu, Zhengguo Li tl;dr: Journal ALIKE, many arch ablations. as good as DISK, but much faster arxiv.org/abs/2304.03608…
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

Deconstructing Denoising Diffusion Models for Self-Supervised Learning
Xinlei Chen, Zhuang Liu Saining Xie Kaiming He

tl;dr: adding the noise in the low-dim space (even PCA works) is the most important of diffusion models for representation learning.

arxiv.org/abs/2401.14404

Deconstructing Denoising Diffusion Models for Self-Supervised Learning Xinlei Chen, @liuzhuang1234 @sainingxie Kaiming He tl;dr: adding the noise in the low-dim space (even PCA works) is the most important of diffusion models for representation learning. arxiv.org/abs/2401.14404
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation
Yuanwen Yue Sabarinath Mahadevan Jonas Schult Francis Engelmann Bastian Leibe Konrad Schindler TheodoraKontogianni
tl;dr:if you are doing interactive segmentation - pre-extract features

arxiv.org/abs/2306.00977…

AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation @YueYuanwen Sabarinath Mahadevan @JonasSchultCV @FrancisEngelman Bastian Leibe Konrad Schindler @DoraKontog tl;dr:if you are doing interactive segmentation - pre-extract features #ICLR2024 arxiv.org/abs/2306.00977…
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

Kudos to Philipp Lindenberger and Paul-Edouard Sarlin for adding DoGHardNet model to official LightGlue trained by Kornia team.
That is a significant upgrade, if you have to use DoG(SIFT) detector.
As simple as
matcher = LightGlue(features='doghardnet').eval()

Kudos to @PhilippCSE and @pesarlin for adding DoGHardNet model to official LightGlue trained by @kornia_foss team. That is a significant upgrade, if you have to use DoG(SIFT) detector. As simple as matcher = LightGlue(features='doghardnet').eval()
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

Image Safeguarding: Reasoning with Conditional Vision Language Model and Obfuscating Unsafe Content Counterfactually

Mazal Bethany, Brandon Wherry, Nishant Vishwamitra, Peyman Najafirad

tl;dr: using segmentation+BLIP to find NSFW parts of image
arxiv.org/abs/2401.11035…

Image Safeguarding: Reasoning with Conditional Vision Language Model and Obfuscating Unsafe Content Counterfactually Mazal Bethany, Brandon Wherry, Nishant Vishwamitra, Peyman Najafirad tl;dr: using segmentation+BLIP to find NSFW parts of image arxiv.org/abs/2401.11035…
account_circle
Dmytro Mishkin 🇺🇦(@ducha_aiki) 's Twitter Profile Photo

PhotoBot: Reference-Guided Interactive Photography via Natural Language

Oliver Limoyo, Jimmy Li, Dmitriy Rivkin, Jonathan Kelly, Gregory Dudek

tl;dr: LLM + DiNO + RANSAC + robot to copy people's poses from the internet.

arxiv.org/abs/2401.11061…

PhotoBot: Reference-Guided Interactive Photography via Natural Language Oliver Limoyo, Jimmy Li, Dmitriy Rivkin, Jonathan Kelly, Gregory Dudek tl;dr: LLM + DiNO + RANSAC + robot to copy people's poses from the internet. arxiv.org/abs/2401.11061…
account_circle