Clustering and geometrical features for classi cation of humans in three dimensional data

Publicerad

Typ

Examensarbete för masterexamen

Modellbyggare

Tidskriftstitel

ISSN

Volymtitel

Utgivare

Sammanfattning

Computer vision is a rich research field that uses images from normal or technically specific cameras to perform tasks ranging from surveillance to autonomous driving. Computer vision with depth images are however relatively new. Depth images add a third dimension to the image by giving every pixel a depth value and can be produced with several different camera types. The implications for depth imagery is that if an object can be classified in a depth image as e.g a trashcan, a cat or a human, the nature of the data immediately also gives us the distance and position of that object. The extra data from these cameras can enable estimating the volumes and sizes of objects in an image with some extra processing. The focus of this master thesis is on the processing and analysis of this depth data to enable object identification and human classification. Much research has been on analysing 3D data from the 2D perspective, in this thesis the captured data is first converted to Cartesian coordinates before attempting classification, yielding further possibilities. The goal of this thesis was to find if there are some anthropomorphic-geometrical features that can describe the human body well enough to accurately classify humans in the Cartesian data. The features are used in two ways, as a Heuristical-geometrical filter and as features for a support vector machine. Furthermore the thesis presents a successful dynamic adaption of the fast-DBSCAN (Density Based Spatial Clustering of Applications with Noise clustering) algorithm for 3D Cartesian data and a slice method for finding local maxima of point cloud objects. The results show that anthropomorphic-geometrical features can to an extent be used to classify Cartesian point cloud data. Low resolution cameras has potential for classification purposes as resolution seem to have little effect on geometrical classification as long as human resolution is no less then 20px vertically. Some further work would be needed to create a anthropomorphic-geometrical for real world application.

Beskrivning

Ämne/nyckelord

Depth camera, moving platform, human machine interaction, localisation, clustering, human classification, 3D

Citation

Arkitekt (konstruktör)

Geografisk plats

Byggnad (typ)

Byggår

Modelltyp

Skala

Teknik / material

Index

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced