Edgar (@edgarriba) Twitter Tweets • TwiCopy

Edgar

@edgarriba

+ Follow

Creator of @kornia_foss and co-founder of https://t.co/mYuSg1ClqG | Computer Vision Researcher

ID:249701367

linkhttps://github.com/edgarriba calendar_today09-02-2011 16:06:48

1,8K Tweets

1,6K Followers

1,2K Following

Kornia

2 weeks ago

Specularity Factorization for Low-Light Enhancement

Saurabh Saini, P J Narayanan

tl;dr: estimate multiple (model-based) noise factors to image enhancement.
#kornia used for differentiable bilateral filtering.
arxiv.org/abs/2404.01998…
sophont01.github.io/data/projects/…

Specularity Factorization for Low-Light Enhancement Saurabh Saini, P J Narayanan tl;dr: estimate multiple (model-based) noise factors to image enhancement. #kornia used for differentiable bilateral filtering. arxiv.org/abs/2404.01998… sophont01.github.io/data/projects/…

thumb_up_off_alt18

chat_bubble_outline0

account_circle

Kornia

2 weeks ago

MuST: Robust Image Watermarking for Multi-Source Tracing

Guanjie Wang, Zehua Ma,Chang Liu, Xi Yang, Han Fang, Weiming Zhang, Nenghai Yu

tl;dr: watermarks for collages - to trace all inputs. #kornia used for differentiable data augmentation.

ojs.aaai.org/index.php/AAAI…

MuST: Robust Image Watermarking for Multi-Source Tracing Guanjie Wang, Zehua Ma,Chang Liu, Xi Yang, Han Fang, Weiming Zhang, Nenghai Yu tl;dr: watermarks for collages - to trace all inputs. #kornia used for differentiable data augmentation. ojs.aaai.org/index.php/AAAI…

thumb_up_off_alt10

chat_bubble_outline0

account_circle

Edgar

2 weeks ago

kornia-rs is the project I’m focused working right now. Porting computer vision algorithms to safe #rust

thumb_up_off_alt31

chat_bubble_outline0

account_circle

Jao

1 month ago

we've also added a small feature that allows us to use dictionaries in the AugmentationSequential container instead of data keys.

more details in the docs: kornia.readthedocs.io/en/stable/augm…

we've also added a small feature that allows us to use dictionaries in the AugmentationSequential container instead of data keys. more details in the docs: kornia.readthedocs.io/en/stable/augm…

thumb_up_off_alt8

chat_bubble_outline0

account_circle

Edgar

1 month ago

New release of kornia is out

thumb_up_off_alt11

chat_bubble_outline0

account_circle

Kornia

1 month ago

0.7.2 is out!
- Added DeDoDe features (thanks Johan Edstedt )
- LightGlue models, available nowhere else - DeDoDe (B/G), KeyNet-HardNet
- KMeans implementation
- New augmentations: RandomGaussianIllumination, RandomLinearIllumination, RandomLinearCorner
1/2
github.com/kornia/kornia/…

0.7.2 is out! - Added DeDoDe features (thanks @Parskatt ) - LightGlue models, available nowhere else - DeDoDe (B/G), KeyNet-HardNet - KMeans implementation - New augmentations: RandomGaussianIllumination, RandomLinearIllumination, RandomLinearCorner 1/2 github.com/kornia/kornia/…

thumb_up_off_alt130

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

2 months ago

Multilinear Operator Networks

Yixin Cheng, Grigoris Chrysos, Markos Georgopoulos, Volkan Cevher

tl;dr: element-wise multiplication and layer norm is all you need (pun intended)

arxiv.org/abs/2401.17992

Multilinear Operator Networks Yixin Cheng, @Grigoris_c, Markos Georgopoulos, @CevherLIONS tl;dr: element-wise multiplication and layer norm is all you need (pun intended) arxiv.org/abs/2401.17992

thumb_up_off_alt9

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

2 months ago

Grandmaster-Level Chess Without Search

Anian Ruoss, Grégoire Delétang, Sourabh Medapati, Jordi Grau-Moya, Kevin Li, Elliot Catt, John Reid, Tim Genewein

tl;dr: supervised learning labeled by Stockfish FTW.
Many ablations, also scaling experiment
arxiv.org/abs/2402.04494…

Grandmaster-Level Chess Without Search @anianruoss, @gregdeletang, @activelifetribe, @jordigraumo, @liwenliang, Elliot Catt, @__Reidy__, Tim Genewein tl;dr: supervised learning labeled by Stockfish FTW. Many ablations, also scaling experiment arxiv.org/abs/2402.04494…

thumb_up_off_alt14

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

2 months ago

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

Natasha Butt, Blaze(j) Manczak 🇵🇱🇱🇺🇪🇺, Auke Wiggers, Corrado Rainone, David Zhang, Michaël Defferrard, Taco Cohen

tl;dr: sample a program, try it, add to the replay pool.
New sota on ARC
arxiv.org/abs/2402.04858…

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay @NatashaEve4, @blazejmanczak, @aukejw, Corrado Rainone, David Zhang, @m_deff, @TacoCohen tl;dr: sample a program, try it, add to the replay pool. New sota on ARC arxiv.org/abs/2402.04858…

thumb_up_off_alt36

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

2 months ago

A Single Simple Patch is All You Need for AI-generated Image Detection

Jiaxuan Chen, Jieteng Yao, Li Niu
arxiv.org/abs/2402.01123

tl;dr: image generators produce 'cleaner' images than real, makes easy to detect.
My comment: compare vs high-quality stock photo, not ImageNet?

A Single Simple Patch is All You Need for AI-generated Image Detection Jiaxuan Chen, Jieteng Yao, Li Niu arxiv.org/abs/2402.01123 tl;dr: image generators produce 'cleaner' images than real, makes easy to detect. My comment: compare vs high-quality stock photo, not ImageNet?

thumb_up_off_alt93

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

2 months ago

Extreme Two-View Geometry From Object Poses
with Diffusion Models

Yujing Sun, Caiyi Sun, Yuan Liu, Yuexin Ma, Siu Ming Yiu

tl;dr:
ASIFT/MODS view generation for matching meets diffusion.
arxiv.org/abs/2402.02800

Extreme Two-View Geometry From Object Poses with Diffusion Models Yujing Sun, Caiyi Sun, Yuan Liu, Yuexin Ma, Siu Ming Yiu tl;dr: ASIFT/MODS view generation for matching meets diffusion. arxiv.org/abs/2402.02800

thumb_up_off_alt93

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

2 months ago

Region-Based Representations Revisited

Michal Shlapentokh-Rothman, Ansel Blume, Yao Xiao, Yuqun Wu, Sethuraman T V, Heyi Tao, Jae Yong Lee, Wilfredo Torres, Yu-Xiong Wang, Derek Hoiem

tl;dr: in title, SLIC helps SAM.
arxiv.org/pdf/2402.02352…

Region-Based Representations Revisited Michal Shlapentokh-Rothman, Ansel Blume, Yao Xiao, Yuqun Wu, Sethuraman T V, Heyi Tao, Jae Yong Lee, Wilfredo Torres, Yu-Xiong Wang, Derek Hoiem tl;dr: in title, SLIC helps SAM. arxiv.org/pdf/2402.02352…

thumb_up_off_alt40

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

2 months ago

CLIP Can Understand Depth

Dunam Kim, Seokju Lee

tl;dr: learn input embedding for the monodepth, then use it for the input -> not sota, but quite good, unlike previous CLIP based depth.
arxiv.org/abs/2402.03251…

CLIP Can Understand Depth Dunam Kim, Seokju Lee tl;dr: learn input embedding for the monodepth, then use it for the input -> not sota, but quite good, unlike previous CLIP based depth. arxiv.org/abs/2402.03251…

thumb_up_off_alt184

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

2 months ago

MESA: Matching Everything by Segmenting Anything

Yesheng Zhang, Xu Zhao

tl;dr: use SAM as region detector, match regions, then get point correspondences using traditional matchers.
No eval on IMC. Also, no eval with SG or LightGlue.

arxiv.org/abs/2401.16741

MESA: Matching Everything by Segmenting Anything Yesheng Zhang, Xu Zhao tl;dr: use SAM as region detector, match regions, then get point correspondences using traditional matchers. No eval on IMC. Also, no eval with SG or LightGlue. arxiv.org/abs/2401.16741

thumb_up_off_alt102

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

1 year ago

ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation

Xiaoming Zhao, Xingming Wu, Weihai Chen, Peter C. Y. Chen, Qingsong Xu, Zhengguo Li

tl;dr: Journal ALIKE, many arch ablations. as good as DISK, but much faster

arxiv.org/abs/2304.03608…

ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation Xiaoming Zhao, Xingming Wu, Weihai Chen, Peter C. Y. Chen, Qingsong Xu, Zhengguo Li tl;dr: Journal ALIKE, many arch ablations. as good as DISK, but much faster arxiv.org/abs/2304.03608…

thumb_up_off_alt85

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

2 months ago

Deconstructing Denoising Diffusion Models for Self-Supervised Learning
Xinlei Chen, Zhuang Liu Saining Xie Kaiming He

tl;dr: adding the noise in the low-dim space (even PCA works) is the most important of diffusion models for representation learning.

arxiv.org/abs/2401.14404

Deconstructing Denoising Diffusion Models for Self-Supervised Learning Xinlei Chen, @liuzhuang1234 @sainingxie Kaiming He tl;dr: adding the noise in the low-dim space (even PCA works) is the most important of diffusion models for representation learning. arxiv.org/abs/2401.14404

thumb_up_off_alt26

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

2 months ago

AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation
Yuanwen Yue Sabarinath Mahadevan Jonas Schult Francis Engelmann Bastian Leibe Konrad Schindler TheodoraKontogianni
tl;dr:if you are doing interactive segmentation - pre-extract features
#ICLR2024
arxiv.org/abs/2306.00977…

AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation @YueYuanwen Sabarinath Mahadevan @JonasSchultCV @FrancisEngelman Bastian Leibe Konrad Schindler @DoraKontog tl;dr:if you are doing interactive segmentation - pre-extract features #ICLR2024 arxiv.org/abs/2306.00977…

thumb_up_off_alt24

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

2 months ago

Kudos to Philipp Lindenberger and Paul-Edouard Sarlin for adding DoGHardNet model to official LightGlue trained by Kornia team.
That is a significant upgrade, if you have to use DoG(SIFT) detector.
As simple as
matcher = LightGlue(features='doghardnet').eval()

Kudos to @PhilippCSE and @pesarlin for adding DoGHardNet model to official LightGlue trained by @kornia_foss team. That is a significant upgrade, if you have to use DoG(SIFT) detector. As simple as matcher = LightGlue(features='doghardnet').eval()

thumb_up_off_alt97

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

2 months ago

Image Safeguarding: Reasoning with Conditional Vision Language Model and Obfuscating Unsafe Content Counterfactually

Mazal Bethany, Brandon Wherry, Nishant Vishwamitra, Peyman Najafirad

tl;dr: using segmentation+BLIP to find NSFW parts of image
arxiv.org/abs/2401.11035…

Image Safeguarding: Reasoning with Conditional Vision Language Model and Obfuscating Unsafe Content Counterfactually Mazal Bethany, Brandon Wherry, Nishant Vishwamitra, Peyman Najafirad tl;dr: using segmentation+BLIP to find NSFW parts of image arxiv.org/abs/2401.11035…

thumb_up_off_alt6

chat_bubble_outline0

account_circle

Dmytro Mishkin 🇺🇦

2 months ago

PhotoBot: Reference-Guided Interactive Photography via Natural Language

Oliver Limoyo, Jimmy Li, Dmitriy Rivkin, Jonathan Kelly, Gregory Dudek

tl;dr: LLM + DiNO + RANSAC + robot to copy people's poses from the internet.

arxiv.org/abs/2401.11061…

PhotoBot: Reference-Guided Interactive Photography via Natural Language Oliver Limoyo, Jimmy Li, Dmitriy Rivkin, Jonathan Kelly, Gregory Dudek tl;dr: LLM + DiNO + RANSAC + robot to copy people's poses from the internet. arxiv.org/abs/2401.11061…

thumb_up_off_alt8

chat_bubble_outline0

account_circle

fpc ok :)