tezvyn:

Computer Vision

Image/video models, diffusion, OCR, multimodal

27 bites

Computer Vision30 sec read

Homography: Mapping Flat Surfaces Between Images

A homography is a 3x3 matrix that maps points between two images of a flat surface. It's used for panorama stitching and perspective correction. The footgun: it only works for planes or pure camera rotation, failing on scenes with depth.

Computer Vision30 sec read

Point Cloud: A 3D Shape as a Dust Cloud of Data

A point cloud represents a 3D object as a cloud of individual data points in space. It's the raw output from 3D scanners, used to create CAD models or GIS maps. The footgun is assuming it's a solid model; it has no surfaces, only disconnected.

Computer Vision30 sec read

The Sobel Operator: Fast, Cheap Edge Detection

The Sobel operator finds image edges by measuring how fast pixel brightness changes horizontally and vertically. It's a fast, cheap first pass for edge detection in computer vision. The footgun is treating it as precise; it's a crude approximation.

Computer Vision30 sec read

Median Filter: Smoothing Images Without Blurring Edges

A median filter cleans image noise by replacing each pixel with the median value of its neighbors, ignoring outliers. It's used to remove "salt-and-pepper" noise before object detection.

Computer Vision30 sec read

BRDF: Modeling How Surfaces Reflect Light

A BRDF is a function that describes a surface's appearance by defining how it reflects light from any incoming direction to any viewing direction. It's used in rendering engines to create materials like brushed metal or velvet.

Computer Vision30 sec read

CIELAB Color Space: Measuring Color Beyond RGB

CIELAB models color not by how a screen displays it (like RGB), but by how a human perceives it: lightness, red/green, and yellow/blue. It's used in industry to measure and match colors precisely. The footgun is assuming it's perfectly 'perceptually uniform'.

Computer Vision30 sec read

Digital Images as Grids of Pixels

Think of a digital image as a mosaic of tiny colored tiles called pixels. This 'raster' method stores the exact color of each point, making it perfect for photos. The footgun: scaling up reveals the grid, causing blurriness or pixelation.