CSC 553-CV Lec 01

CSC 553: Computer Vision
Lec 01: Introduction to Computer Vision
Faculty: Dr. M. Rokonuzzaman
1
State-of-the-art
 The field of computer vision can be characterized as immature and
diverse. Even though earlier work exists, it was not until the late
1970s that a more focused study of the field started when computers
could manage the processing of large data sets such as images.
 However, these studies usually originated from various other fields,
and consequently there is no standard formulation of "the computer
vision problem".
 Also, and to an even larger extent, there is no standard formulation of
how computer vision problems should be solved. Instead, there exists
an abundance of methods for solving various well-defined computer
vision tasks, where the methods often are very task specific and
seldom can be generalized over a wide range of applications.
 Many of the methods and applications are still in the state of basic
research, but more and more methods have found their way into
commercial products, where they often constitute a part of a larger
system which can solve complex tasks (e.g., in the area of medical
images, or quality control and measurements in industrial processes).
 In most practical computer vision applications, the computers are
pre-programmed to solve a particular task, but methods based on
learning are now becoming increasingly common.
2
Why study Computer Vision?
 Images and video are everywhere
 Fast-growing collection of useful applications
 matching and modifying images from digital cameras
 film special effects and post-processing
 building representations of the 3D world from pictures
 medical imaging, household robots, security, traffic control,
cell phone location, face finding, video game interfaces, ...
 Various deep and attractive scientific mysteries
 what can we know from an image?
 how does object recognition work?
 Greater understanding of human vision and the brain
 about 25% of the human brain is devoted to vision
3
Computer Vision with Different Names:
4
Purpose of Computer Vision:
 The classical problem in computer vision, image
processing and machine vision is that of determining
whether or not the image data contains some specific
object, feature, or activity.
 This task can normally be solved robustly and without
effort by a human, but is still not satisfactorily solved
in computer vision for the general case: arbitrary
objects in arbitrary situations.
 The existing methods for dealing with this problem
can at best solve it only for specific objects, such as
simple geometric objects (e.g., polyhedrons), human
faces, printed or hand-written characters, or vehicles,
and in specific situations, typically described in terms
of well-defined illumination, background, and pose of
the object relative to the camera.
5
Different varieties of the recognition problem are described in
the literature:
 Recognition: one or several pre-specified or learned
objects or object classes can be recognized, usually
together with their 2D positions in the image or 3D poses in
the scene.
 Identification: An individual instance of an object is
recognized. Examples: identification of a specific person's
face or fingerprint, or identification of a specific vehicle.
 Detection: the image data is scanned for a specific
condition. Examples: detection of possible abnormal cells or
tissues in medical images or detection of a vehicle in an
automatic road toll system. Detection based on relatively
simple and fast computations is sometimes used for finding
smaller regions of interesting image data which can be
further analyzed by more computationally demanding
techniques to produce a correct interpretation.
6
Several specialized tasks based on recognition exist, such as:
 Content-based image retrieval: finding all images in a larger
set of images which have a specific content. The content can be
specified in different ways, for example in terms of similarity
relative a target image (give me all images similar to image X), or
in terms of high-level search criteria given as text input (give me
all images which contains many houses, are taken during winter,
and have no cars in them).
 Pose estimation: estimating the position or orientation of a
specific object relative to the camera. An example application for
this technique would be assisting a robot arm in retrieving objects
from a conveyor belt in an assembly line situation.
 Optical character recognition (or OCR): identifying characters
in images of printed or handwritten text, usually with a view to
encoding the text in a format more amenable to editing or
indexing (e.g. ASCII).
7
Motion
 Several tasks relate to motion estimation, in which an
image sequence is processed to produce an estimate
of the velocity either at each points in the image or in
the 3D scene. Examples of such tasks are:
Egomotion: determining the 3D rigid motion of the
camera.
 Tracking: following the movements of objects (e.g.
vehicles or humans).
 Scene reconstruction
 Given one or (typically) more images of a scene, or a
video, scene reconstruction aims at computing a
3D model of the scene. In the simplest case the
model can be a set of 3D points. More sophisticated
methods produce a complete 3D surface model.
8
Image restoration
 The aim of image restoration is the removal of noise
(sensor noise, motion blur, etc.) from images. The
simplest possible approach for noise removal is
various types of filters such as low-pass filters or
median filters.
 More sophisticated methods assume a model of how
the local image structures look like, a model which
distinguishes them from the noise.
 By first analysing the image data in terms of the local
image structures, such as lines or edges, and then
controlling the filtering based on local information
from the analysis step, a better level of noise removal
is usually obtained compared to the simpler
approaches.
9
10
11
12
13
Application: Augmented Reality
Application areas:
 Film production (the
“match move” problem)
 Heads-up display for cars
 Tourism
 Architecture
 Training
Technical challenges:
 Recognition of scene
 Accurate sub-pixel 3-D

pose
 Real-time, low latency
14
Application: Medical augmented
Reality
Visually guided surgery: recognition and registration

15
Application: Automobile
navigation
Lane departure warning Pedestrian detection
Mobileye (see mobileye.com)

 Other applications: intelligent cruise control, lane change
assist, collision mitigation
 Systems already used in trucks and high-end cars
16
Vision based System as
Intelligent Machine:
 Machines having sensing, perceiving, reasoning and
acting capabilities similar to human in order to making
human presence on this planet more meaningful.
 We intend to develop intelligent machines in order to
improve quality of life. Intelligent machines are not
meant to be competitors to human beings.
 Intelligent machines are being developed in order to
increase safety, improve productivity and enhance
capability of the society.
17
Structure of an Intelligent
Machine:
Effector 1
Sensor 1
Perception Reasoning Action
Sensor n Effector n
18
Structure of an Intelligent
Machine….:
Agents interact with environments through sensors and effectors.
Perception
Environ-
ment Sensors
agent
Action
Effectors
19
An example:
20
Vision challenges in the shown
machine:
 Changing lighting condition
 Objects of different shapes, colors
and sizes
 Objects may be partially overlaid on
one another
 Many more…
21
Vision based machine for
Exploration in harsh environment:
22
machine:
 Natural environment
 Non-geometric objects
 Non-uniform lighting condition
 Rugged surface makes image
stabilization difficult
 Many more…
23
Vision based machine for textile
defects detection:
24
machine:
 Defects are non-geometric in
nature
 Defects could be very small
 Defects could be similar to textile
patterns
 Many more…
25
Vision based parts assembly:
26
machine:
 Fine precision in measurement
 Alignment detection
 Depth detection
 Many more..
27
Vision system for grading food
products:
:
<>
28
machine:
 Random background
 Defects may appear as food
ingredients
 High speed of inspection
 Hazardous work environment
 Many more…
29
Vision and Semiconductor
Inspection:
30
Vision and Semiconductor
Inspection:
31
machine:
 Very high resolution sensing
(imaging)
 High precision filtering
 High speed
 Many more…
32
Vision based Vehicle Collision
Avoidance system:
 It should work in different
lighting and weather conditions
 High speed
 Tracking capability
 Many more…
33
34
Examples of Algorithms of Computer
Vision: Filtering
Noisy Image Filtered Image
35
Vision: Segmentation
36
37
38
Vision: Feature Identification
39
40
41
42
Basic Steps of Computer Vision
Application Development
 Information to be extracted
 Scene analysis and feature selection
 Lighting design
 Camera design
 Image capturing and processing
 Feature extraction
 Feature classification
 Information derivation.
43
44

CSC 553-CV Lec 01

Diunggah oleh

Informasi Dokumen

Deskripsi Asli:

Hak Cipta

Format Tersedia

Bagikan dokumen Ini

Bagikan atau Tanam Dokumen

Opsi Berbagi

Apakah menurut Anda dokumen ini bermanfaat?

Apakah konten ini tidak pantas?

Hak Cipta:

Format Tersedia

CSC 553-CV Lec 01

Diunggah oleh

Hak Cipta:

Format Tersedia

CSC 553: Computer Vision

Lec 01: Introduction to Computer Vision

Faculty: Dr. M. Rokonuzzaman

 Accurate sub-pixel 3-D

Visually guided surgery: recognition and registration

Lane departure warning Pedestrian detection

Mobileye (see mobileye.com)

Perception Reasoning Action

Noisy Image Filtered Image

Anda mungkin juga menyukai