Kumpulan facta
Data Mining
• Data mining (pencarian pengetahuan dari data)
• Definisi Data Mining:
Mengekstrak secara otomatis pola atau pengetahuan yang
menarik (tidak sederhana, tersembunyi, tidak diketahui
sebelumnya, berpotensi berguna) dari data dalam jumlah
sangat besar.
The Information Continuum
• Origin
• Authenticity
• Trustworthiness
• Completeness
• Integrity
Big Data Empowering AI and Machine Learning
Industry 4.0
Six Design Principles
• Interoperability: the ability of cyber-physical systems (i.e. work piece carriers,
assembly stations and products), humans and Smart Factories to connect and
communicate with each other via the Internet of Things and the Internet of Services
• Virtualization: a virtual copy of the Smart Factory which is created by linking sensor
data (from monitoring physical processes) with virtual plant models and simulation
models
• Decentralization: the ability of cyber-physical systems within Smart Factories to make
decisions on their own
• Real-Time Capability: the capability to collect and analyze data and provide the
insights immediately
• Service Orientation: offering of services (of cyber-physical systems, humans and Smart
Factories) via the Internet of Services
• Modularity: flexible adaptation of Smart Factories for changing requirements of
individual modules
SINTA: Towards Big Data
Current Progress
Data Integration:
A Higher-level Abstraction
Query Independence of:
• source & location
Mediated Schema • data model, syntax
• semantic variations
•…
Semantic
Mappings
SSN
S1 Name Category SSN CID S2 S3
123-45-6789 Charles undergrad 123-45-6789 CSE444 <cd> <title> The best of … </title>
… …
234-56-7890 Dan grad 123-45-6789 CSE444
… … 234-56-7890 CSE142 <artist> Carreras </artist>
… <artist> Pavarotti </artist>
CID Name Quarter <artist> Domingo </artist>
CSE444 Databases fall
CSE541 Operating systems winter <price> 19.95 </price>
</cd>
Interoperability
• The exchange of information that preserves the meaning and
relationships of the data exchanged.
• Interoperability is the property that allows for the unrestricted
sharing of resources between different systems. This can refer to the
ability to share data between different components or machines,
both via software and hardware, or it can be defined as the exchange
of information and resources between different computers
Data aggregation
• Data aggregation is any process in which information is gathered and
expressed in a summary form, for purposes such as statistical
analysis. A common aggregation purpose is to get more information
about particular groups based on specific variables such as age,
profession, or income.
Interoperability Source
PD-DIKTI
Pemeringkatan Author: overall vs 3 years
Top Afiliasi : 3 years, Overall years
Profil Afiliasi
Profil Author
Collaboration Network
(author)
Author
(2 level
network)
PD-DIKTI
V-4: Veracity
Kepercayaan data tergantung dari:
• Validity profile by PD-DIKTI
• Level of trusted indexer: Scopus, WoS, Crossref, GARUDA, GS
Next SINTA Work
• Machine Learning Applied
• Field Area categorization
• Prediction of next
• Normalized Field Area of Author performance
• Personality behavior detection
• Publication Pattern
• Plagiarism Detection
Terimakasih
imam@unissula.ac.id