Anda di halaman 1dari 17

SPATIAL MINING

ATTRIBUTES

Pratheepchanther T
I M.Sc Data Analytics
Agenda
Spatial Mining – Introduction

Spatial Data Mining Architecture

Spatial Data – Properties

Spatial Databases

Spatial Mining – Attributes

Spatial Mining - Attribute values

Spatial Mining - Attribute types


Spatial Mining - Introduction
Spatial data mining is the application of data mining to spatial models. In spatial data mining, analysts
use geographical or spatial information to produce business intelligence or other results. This requires specific
techniques and resources to get the geographical data into relevant and useful formats.

What is : core goal of a spatial data mining project is to distinguish the information in order to build real,
actionable patterns to present, excluding things like statistical coincidence, randomized spatial modeling or
irrelevant results. One way analysts may do this is by combing through data looking for "same-object" or "object-
equivalent" models to provide accurate comparisons of different geographic locations.

What is not : setting up a visual map of geographic data

Ref: https://www.techopedia.com/definition/30595/spatial-data-mining
Spatial Data Mining Architecture
HUMAN COMPUTER INTERACTION SYSTEM

SPATIAL DATA MINING


DISCOVERABLE KNOWLEDGE
SYSTEM
KNOWLEDGE BASE
DATA RELATED TO PROBLEM MANAGEMENT SYSTEM

SPATIAL DATA BASE DOMAIN KNOWLEDGE


MANAGEMENT SYSTEM DATABASE

SPATIAL DATA BASE

Ref:spatialdatamining-150327100401-conversion-gate01.pdf
Spatial Data - Properties

• Spatial • Spatial • Implicit Spatial


autocorrelation heterogeneity Relations

Ref:spatialdatamining-150327100401-conversion-gate01.pdf
Spatial Data - Properties - Spatial autocorrelation
Items in a traditional data are independent of each other, – whereas
properties of locations in a map are often “auto-correlated”.

First law of Geography [Tobler]: – Everything is related to everything, but


nearby things are more related than distant things.

Example:
• People with similar backgrounds tend to live in the same area.
• Economies of nearby regions tend to be similar
• Changes in temperature occur gradually over space

Ref:spatialdatamining-150327100401-conversion-gate01.pdf
Spatial Data - Properties - Spatial heterogeneity
• Auto correlation
• Patterns usually have to be defined in the spatial attribute subspace
and not in the complete attribute space
• Longitude and latitude (or other coordinate systems) are the glue that
link different data collections together.
• People are used to maps in GIS therefore, data mining results have to
summarized on the top of maps.
• Patterns not only refer to points, but can also refer to lines, or
polygons or other higher order geometrical objects

Ref:spatialdatamining-150327100401-conversion-gate01.pdf
Spatial Data - Properties - Spatial Relations
Spatial databases do not store spatial relations explicitly – Additional functionality
required to compute them

Three types of spatial relations specified by the Open Geospatial Consortium (OGC)
reference model

– Distance relations
Euclidean distance between two spatial features

– Direction relations In mathematics, topology is


Ordering of spatial features in space concerned with the properties of a
geometric object that are
preserved under continuous
– Topological relations deformations, such as stretching,
twisting, crumpling and bending,
Characterize the type of intersection between spatial features but not tearing or gluing.

Ref:spatialdatamining-150327100401-conversion-gate01.pdf
Spatial Databases
Stores a large amount of space-related data Few Spatial databases:
• Maps SQL, MySQL(geometry),
• Remote Sensing Neo4J, IBM Informix,
Oracle Spatial, H2, SAP
• Medical Imaging HANA,GeoMesa
• VLSI chip layout (Apache’s spatio-
temporal database)

Have Topological and distance information


• Require spatial indexing
• Data access
• Reasoning
• Geometric computation
• Knowledge representation techniques

Ref:1) spatialdatamining-150327100401-conversion-gate01.pdf
2) https://en.wikipedia.org/wiki/Spatial_database
Spatial Mining - Attributes
Spatial Information - are generally multi-dimensional and correlated
• Spatial attribute: geographically referenced
• Neighbourhood and extent
• Location, e.g., longitude, latitude, elevation

Non-spatial Information - are generally one dimensional and independent


• Same as data in traditional data mining
• Numerical, Categorical, Ordinal, Boolean, etc
e.g., city name, city population

Ref:spatialdatamining-150327100401-conversion-gate01.pdf
Spatial Mining - Attributes
Discrete attributes
Has only a finite or countable infinite set of values
Example:
• Zip codes,
• Count or set of words in a collection of documents
• Often represented as integer variables
• Note: Binary attributes are a special case of discrete attributes

Continuous attributes
Has real numbers as attribute values. Real values can only be measured and
represented using finite number of digits and typically represented as floating
point variables
Example:
• Temperature
• Height or Weight
Ref:http://user.it.uu.se/~kostis/Teaching/DM-05/Slides/lecture02.pdf
Spatial Mining - Attribute values
Numbers or Symbols assigned to an attribute
Attribute v/s Attribute values
Same attribute can be mapped to different values
Example: Height can be measured in feet or meters

Different attribute can be mapped to same set of values


Example: Attribute values for ID and age are integers

But properties of attribute values can be different


Example: ID has no limit but age has min & max

Ref: http://www.jaist.ac.jp/~bao/talks/SpatialDM.pdf
Spatial Mining - Attribute types

Ref: https://sqlmentalist.com/2012/06/28/bisql-100-sql-server-with-spatial-dataopportunity-use-new-features-and-example-with-few-queries/
SPATIAL ATTRIBUTE DATA

SPATIAL data ATTRIBUTE data


All types of data objects or elements Characters of geographical feature
present in geographical space that at qualitative and quantitative in
nature
Describe the absolute and relative Describe the characteristics of
location of geographical feature geographical feature
Satellite images, scanned maps etc Town planning departments, fire
help to obtain spatial data departments, environmental groups
etc help to obtain attribute data

Ref: https://pediaa.com/what-is-the-difference-between-attribute-data-and-spatial-data/
Spatial Mining - Data types

Ref: https://sqlmentalist.com/2012/06/28/bisql-100-sql-server-with-spatial-dataopportunity-use-new-features-and-example-with-few-queries/
Spatial Mining - Attribute types
Attribute
Description Transformation Operation Comments Example
type
The values of a nominal attribute
Nominal

are just different names, i.e., mode, entropy, If all employee ID numbers zip codes, employee ID
Any permutation of
nominal attributes provide only contingency were reassigned, would it make numbers, eye color,
values
enough information to distinguish correlation, χ2 test any difference? sex: {male, female}
one object from another. (=, ≠)
An order preserving median, An attribute encompassing the
hardness of minerals,
Ordinal

The values of an ordinal attribute change of values, i.e., percentiles, rank notion of good, better
{good, better, best},
provide enough information to new_value = correlation, run best can be represented
grades, street numbers
order objects. (<, >) f(old_value) where f is tests, sign tests equally well by the values {1, 2,
a monotonic function. 3} or by {0.5, 1, 10}.
mean, standard
Thus the Fahrenheit and
For interval attributes, the new_value =a * deviation,
Interval

Celsius temperature scales calendar dates,


differences between values are old_value + b where a Pearson's
differ in terms of where their temperature in Celsius
meaningful, i.e., a unit of and b are constants correlation, t and F
zero value is and the size of a or Fahrenheit
measurement exists. (+, - ) tests
unit (degree).

temperature in Kelvin,
geometric mean,
For ratio variables, both monetary quantities,
Ratio

new_value = a * harmonic mean, Length can be measured in


differences and ratios are counts, age, mass,
old_value percent variation meters or feet
meaningful. (*, /) length, electrical
current

Ref: http://www.jaist.ac.jp/~bao/talks/SpatialDM.pdf
Thank you

Anda mungkin juga menyukai