
YOLO, SSD, Faster R-CNN, DETR, there are a lot of models for detecting objects. But, which one is the best?...
Page 1 of 3 · 21 articles
Articles about Deep Learning fundamentals and advanced methods, from supervised to self-supervised learning, including Vision Transformers, DINO, CLIP, SAM2, and more; all implemented in PyTorch.

YOLO, SSD, Faster R-CNN, DETR, there are a lot of models for detecting objects. But, which one is the best?...

We all know how powerful the SAM family is, with simple prompts, precise segmentation can be performed. In 2023, it...

U2SEG is the first unsupervised approach to combine instance, semantic, and panoptic segmentation. It uses MaskCut algorithm from CutLER to...

U-Net was published in 2015 for spotting microscopic cells in biomedical scans, and since then, it became very popular. It...

There are a lot of supervised object detection and instance segmentation models (YOLO, RCNN Family, DETR ...). Pipeline is the...

When we talk about depth in computer vision, we think about stereo cameras, time-of-flight sensors, and LiDAR. These methods don’t...

"a tiny vision language model that kicks ass and runs anywhere", that is exactly how the creators of Moondream defined...

SAM3 is just announced, and everybody is talking about it. But what is this hype all about? As you probably...

→ _An article explaining Grounding DINO and how to detect objects with text prompts_. I have so many articles about...

→ Step-by-step guide for training DETR(Detection Transformer) object detection models in PyTorch with any dataset. When it comes to object...