Clustering and geometrical features for classi cation of humans in three dimensional data

Typ
Examensarbete för masterexamen
Program
Biomedical engineering (MPBME), MSc
Publicerad
2019
Författare
Hedin, David
Wendel, Johan
Modellbyggare
Tidskriftstitel
ISSN
Volymtitel
Utgivare
Sammanfattning
Computer vision is a rich research field that uses images from normal or technically specific cameras to perform tasks ranging from surveillance to autonomous driving. Computer vision with depth images are however relatively new. Depth images add a third dimension to the image by giving every pixel a depth value and can be produced with several different camera types. The implications for depth imagery is that if an object can be classified in a depth image as e.g a trashcan, a cat or a human, the nature of the data immediately also gives us the distance and position of that object. The extra data from these cameras can enable estimating the volumes and sizes of objects in an image with some extra processing. The focus of this master thesis is on the processing and analysis of this depth data to enable object identification and human classification. Much research has been on analysing 3D data from the 2D perspective, in this thesis the captured data is first converted to Cartesian coordinates before attempting classification, yielding further possibilities. The goal of this thesis was to find if there are some anthropomorphic-geometrical features that can describe the human body well enough to accurately classify humans in the Cartesian data. The features are used in two ways, as a Heuristical-geometrical filter and as features for a support vector machine. Furthermore the thesis presents a successful dynamic adaption of the fast-DBSCAN (Density Based Spatial Clustering of Applications with Noise clustering) algorithm for 3D Cartesian data and a slice method for finding local maxima of point cloud objects. The results show that anthropomorphic-geometrical features can to an extent be used to classify Cartesian point cloud data. Low resolution cameras has potential for classification purposes as resolution seem to have little effect on geometrical classification as long as human resolution is no less then 20px vertically. Some further work would be needed to create a anthropomorphic-geometrical for real world application.
Beskrivning
Ämne/nyckelord
Depth camera , moving platform , human machine interaction , localisation , clustering , human classification , 3D
Citation
Arkitekt (konstruktör)
Geografisk plats
Byggnad (typ)
Byggår
Modelltyp
Skala
Teknik / material
Index