Selamat datang di Scribd!

Lewati carousel

Word Embedding

Diunggah oleh

Abdul Samad

0% menganggap dokumen ini bermanfaat (0 suara)

24 tayangan17 halaman

Word embedding

Hak Cipta

Format Tersedia

PDF, TXT atau baca online dari Scribd

Bagikan dokumen Ini

Bagikan atau Tanam Dokumen

Opsi Berbagi

Apakah menurut Anda dokumen ini bermanfaat?

Apakah konten ini tidak pantas?

Laporkan Dokumen Ini

Word embedding

Hak Cipta:

Format Tersedia

Unduh sebagai PDF, TXT atau baca online dari Scribd

Tandai sebagai konten tidak pantas

0% menganggap dokumen ini bermanfaat (0 suara)

24 tayangan17 halaman

Word Embedding

Diunggah oleh

Abdul Samad

Word embedding

Hak Cipta:

Format Tersedia

Unduh sebagai PDF, TXT atau baca online dari Scribd

Tandai sebagai konten tidak pantas

Lompat ke Halaman

Anda di halaman 1dari 17

Cari di dalam dokumen

Word embeddings

We want a compact representation of text so that we could use

it for neural nets!

«Word»
Sparse vector products

text token 1-hot linear

0
0
0
…
0
«word»
Word! 0 Wx ...
(id1337)
1
0
0
…
0
0
Sparse vector products

token 1-hot linear

0 W0
0 W1
0 W2
… …
0 W1335
«word» 0 W1336
(id1337) 1 W1337
0 W1338
0 W1339
… …
0 Wn-1
0 Wn
Sparse vector products

1-hot
token (n tokens) linear
0 W0
0 W1
0 W2
… …
0 W1335
«word» 0 W1336
(id1337) 1 W1337
0 W1338
0 W1339
… …
0 Wn-1
0 Wn
Sparse vector products

1-hot hidden layer

token (n tokens) h units
0
0
0
…

«word»
(id1337)
0
0
1
dot
W ij
?
i=1…n
0 j = 1... h
0
…
0
0
Embedding

1-hot hidden layer

token (n tokens) h units
0
0
0
…

«word»
(id1337)
0
0
1
dot
W ij
i=1…n
0 j = 1... h
0
… row 1337
0
0
Embedding: word2vec
“Peace is a lie, there is only passion”
1-hot hidden layer
(n tokens) h units
0 0
0 1
0 0
… …
0
0
1
dot W ij W jk ~
1
0
0
i=1…n j=1…h
0 j = 1... h k = 1... n 0
0 1
… …
0 1
0 0
Embedding: word2vec
the distributional hypothesis : similar context = similar meaning

Yang Huijeong, http://cscp2.sogang.ac.kr/CSE4187_02/index.php/%ED%8C%8C%EC%9D%BC:8.png

Embedding: word2vec

Side effect: synonyms

“nice” ~ “beautiful”
“hard” ~ “difficult”
Embedding: word2vec

Side effect: synonyms

“nice” ~ “beautiful”
“hard” ~ “difficult”

Side effect: word algebra

“king” - “man” + “woman” ~ “queen”
“moscow” - “russia” + “france” ~ “paris”
Embedding: word2vec
Side effect: word algebra
Softmax problem

Replace words LARGE

with vectors Dense
(row of matrix) layer

hidden layer
h units
Softmax problem
Dense layer, 10^5 units
(Your CPUs gonna burn)
“Embedding layer”
Just takes row from matrix
(super fast)

Replace words Multiply

with vectors by large
(row of matrix) matrix

hidden layer
h units
More word embeddings

Faster softmax:
• Hierarchical softmax, negative samples, …
• learn more
More word embeddings

Faster softmax:
• Hierarchical softmax, negative samples, …
• learn more
Alternative models: GloVe
More word embeddings

Faster softmax:
• Hierarchical softmax, negative samples, …
• learn more
Alternative models: GloVe
Sentence level:
• Doc2vec, skip-thought (using rnn)
More word embeddings

Faster softmax:
• Hierarchical softmax, negative samples, …
• learn more
Alternative models: GloVe
Sentence level:
• Doc2vec, skip-thought (using rnn)

To be continued...
in the NLP course

Anda mungkin juga menyukai

Unsupervised Learning
Dokumen7 halaman
Unsupervised Learning
Abdul Samad
Belum ada peringkat
Natural Language Processng Primer
Dokumen9 halaman
Natural Language Processng Primer
Abdul Samad
Belum ada peringkat
Iinqlab e Iran by DR Sibte Hassan
Dokumen301 halaman
Iinqlab e Iran by DR Sibte Hassan
Abdul Samad
0% (1)
Student ID: 0.00% To 100.00% September 15, 2014 To March 20, 2015
Dokumen1 halaman
Student ID: 0.00% To 100.00% September 15, 2014 To March 20, 2015
Abdul Samad
Belum ada peringkat
0972732446
Dokumen117 halaman
0972732446
Sushant Ingle
Belum ada peringkat
Machine Learnng With Azure
Dokumen65 halaman
Machine Learnng With Azure
Abdul Samad
Belum ada peringkat
Machine Learning Slides
Dokumen13 halaman
Machine Learning Slides
Abdul Samad
Belum ada peringkat
My Slides
Dokumen31 halaman
My Slides
Abdul Samad
Belum ada peringkat
Data Mining Slides
Dokumen43 halaman
Data Mining Slides
Abdul Samad
Belum ada peringkat
Examples of Isratil Paper
Dokumen7 halaman
Examples of Isratil Paper
Abdul Samad
Belum ada peringkat
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Dari Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Penilaian: 4 dari 5 bintang
4/5 (5794)
Shoe Dog: A Memoir by the Creator of Nike
Dari Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Penilaian: 4.5 dari 5 bintang
4.5/5 (537)
Yes Please
Dari Everand
Yes Please
Amy Poehler
Penilaian: 4 dari 5 bintang
4/5 (1891)
The Yellow House: A Memoir (2019 National Book Award Winner)
Dari Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Penilaian: 4 dari 5 bintang
4/5 (98)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Dari Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Penilaian: 4 dari 5 bintang
4/5 (895)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Dari Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Penilaian: 4.5 dari 5 bintang
4.5/5 (344)
The Little Book of Hygge: Danish Secrets to Happy Living
Dari Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Penilaian: 3.5 dari 5 bintang
3.5/5 (399)
Grit: The Power of Passion and Perseverance
Dari Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Penilaian: 4 dari 5 bintang
4/5 (588)
The Emperor of All Maladies: A Biography of Cancer
Dari Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Penilaian: 4.5 dari 5 bintang
4.5/5 (271)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Dari Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Penilaian: 4.5 dari 5 bintang
4.5/5 (266)
Never Split the Difference: Negotiating As If Your Life Depended On It
Dari Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Penilaian: 4.5 dari 5 bintang
4.5/5 (838)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dari Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Penilaian: 3.5 dari 5 bintang
3.5/5 (231)
Principles: Life and Work
Dari Everand
Principles: Life and Work
Ray Dalio
Penilaian: 4 dari 5 bintang
4/5 (599)
On Fire: The (Burning) Case for a Green New Deal
Dari Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Penilaian: 4 dari 5 bintang
4/5 (73)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Dari Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Penilaian: 4.5 dari 5 bintang
4.5/5 (474)
Team of Rivals: The Political Genius of Abraham Lincoln
Dari Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Penilaian: 4.5 dari 5 bintang
4.5/5 (234)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Dari Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Penilaian: 3.5 dari 5 bintang
3.5/5 (2259)
Angela's Ashes: A Memoir
Dari Everand
Angela's Ashes: A Memoir
Frank McCourt
Penilaian: 4.5 dari 5 bintang
4.5/5 (440)
Rise of ISIS: A Threat We Can't Ignore
Dari Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Penilaian: 3.5 dari 5 bintang
3.5/5 (137)
Steve Jobs
Dari Everand
Steve Jobs
Walter Isaacson
Penilaian: 4.5 dari 5 bintang
4.5/5 (806)
Fear: Trump in the White House
Dari Everand
Fear: Trump in the White House
Bob Woodward
Penilaian: 3.5 dari 5 bintang
3.5/5 (738)
The Unwinding: An Inner History of the New America
Dari Everand
The Unwinding: An Inner History of the New America
George Packer
Penilaian: 4 dari 5 bintang
4/5 (45)
Bad Feminist: Essays
Dari Everand
Bad Feminist: Essays
Roxane Gay
Penilaian: 4 dari 5 bintang
4/5 (1015)
John Adams
Dari Everand
John Adams
David McCullough
Penilaian: 4.5 dari 5 bintang
4.5/5 (2409)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Dari Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Penilaian: 4 dari 5 bintang
4/5 (1090)
The Glass Castle: A Memoir
Dari Everand
The Glass Castle: A Memoir
Jeannette Walls
Penilaian: 4.5 dari 5 bintang
4.5/5 (1712)
The Light Between Oceans: A Novel
Dari Everand
The Light Between Oceans: A Novel
M.L. Stedman
Penilaian: 4.5 dari 5 bintang
4.5/5 (789)
The Outsider: A Novel
Dari Everand
The Outsider: A Novel
Stephen King
Penilaian: 4 dari 5 bintang
4/5 (1839)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Dari Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Penilaian: 4.5 dari 5 bintang
4.5/5 (120)
The Woman in Cabin 10
Dari Everand
The Woman in Cabin 10
Ruth Ware
Penilaian: 3.5 dari 5 bintang
3.5/5 (2322)
Brooklyn: A Novel
Dari Everand
Brooklyn: A Novel
Colm Tóibín
Penilaian: 3.5 dari 5 bintang
3.5/5 (1937)
A Man Called Ove: A Novel
Dari Everand
A Man Called Ove: A Novel
Fredrik Backman
Penilaian: 4.5 dari 5 bintang
4.5/5 (4609)
The Perks of Being a Wallflower
Dari Everand
The Perks of Being a Wallflower
Stephen Chbosky
Penilaian: 4.5 dari 5 bintang
4.5/5 (2101)
Wolf Hall: A Novel
Dari Everand
Wolf Hall: A Novel
Hilary Mantel
Penilaian: 4 dari 5 bintang
4/5 (3811)
Little Women
Dari Everand
Little Women
Louisa May Alcott
Penilaian: 4 dari 5 bintang
4/5 (104)
A Tree Grows in Brooklyn
Dari Everand
A Tree Grows in Brooklyn
Betty Smith
Penilaian: 4.5 dari 5 bintang
4.5/5 (1929)
Manhattan Beach: A Novel
Dari Everand
Manhattan Beach: A Novel
Jennifer Egan
Penilaian: 3.5 dari 5 bintang
3.5/5 (792)
The Art of Racing in the Rain: A Novel
Dari Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Penilaian: 4 dari 5 bintang
4/5 (4200)
Sing, Unburied, Sing: A Novel
Dari Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Penilaian: 4 dari 5 bintang
4/5 (1103)
Her Body and Other Parties: Stories
Dari Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Penilaian: 4 dari 5 bintang
4/5 (821)
The Constant Gardener: A Novel
Dari Everand
The Constant Gardener: A Novel
John le Carré
Penilaian: 3.5 dari 5 bintang
3.5/5 (104)
RHEL6 Datasheet
Dokumen8 halaman
RHEL6 Datasheet
Armando Paredes Jara
Belum ada peringkat
Programming Concepts: by Ramanamurthy
Dokumen10 halaman
Programming Concepts: by Ramanamurthy
Ramanamurthy Saripalli
Belum ada peringkat
SAP General Controlling Configurationgfdgfdfdgfdsgdsg: Step 1: Define Your Controlling Area
Dokumen20 halaman
SAP General Controlling Configurationgfdgfdfdgfdsgdsg: Step 1: Define Your Controlling Area
Jyotiraditya Banerjee
Belum ada peringkat
Enet qr001 - en e PDF
Dokumen5 halaman
Enet qr001 - en e PDF
Dicky Eka
Belum ada peringkat
Information and Communications Technology Computer Systems Servicing NC2
Dokumen6 halaman
Information and Communications Technology Computer Systems Servicing NC2
Animus Adamo
Belum ada peringkat
Flow DW
Dokumen200 halaman
Flow DW
Pino Watson Pisolo
Belum ada peringkat
Eee4024 Computer-Architecture-And-Organization TH 1.0 37 Eee4024
Dokumen3 halaman
Eee4024 Computer-Architecture-And-Organization TH 1.0 37 Eee4024
Anbarasan Subramaniyan
Belum ada peringkat
Chapter 8
Dokumen8 halaman
Chapter 8
hotredrose39
Belum ada peringkat
Data Sheet Servo Allen Bradley
Dokumen354 halaman
Data Sheet Servo Allen Bradley
Romulo Silva
Belum ada peringkat
CBSE Class-12 Mathematics NCERT Solution Chapter - 3 Matrices - Exercise 3.4
Dokumen15 halaman
CBSE Class-12 Mathematics NCERT Solution Chapter - 3 Matrices - Exercise 3.4
Satyanshu Kumar
Belum ada peringkat
GPRS Roaming Guidelines V6.0 GSMA
Dokumen23 halaman
GPRS Roaming Guidelines V6.0 GSMA
Backface Faceback
Belum ada peringkat
Ethernet: Introduction To Networks
Dokumen61 halaman
Ethernet: Introduction To Networks
llekhanya-1
Belum ada peringkat
Queen Problem - Allegro 5
Dokumen3 halaman
Queen Problem - Allegro 5
Alexandru Gabriel Stoica
Belum ada peringkat
Bukkit's Database Engine
Dokumen29 halaman
Bukkit's Database Engine
cryxli
Belum ada peringkat
CS604 Quiz-2 File by Vu Topper RM
Dokumen65 halaman
CS604 Quiz-2 File by Vu Topper RM
M. Khizar
Belum ada peringkat
Work Orders
Dokumen24 halaman
Work Orders
Shawna Martin
Belum ada peringkat
Radimation
Dokumen370 halaman
Radimation
Aleksandar Konatar
Belum ada peringkat
Programming Concepts in Java
Dokumen12 halaman
Programming Concepts in Java
rishabhindoria57
Belum ada peringkat
Python Practical Questions
Dokumen7 halaman
Python Practical Questions
Justin D'souza
Belum ada peringkat
Computer Brahmastra E-BOOK
Dokumen99 halaman
Computer Brahmastra E-BOOK
Sayantan Mandal
Belum ada peringkat
Project Report On Customer Lifetime Value
Dokumen23 halaman
Project Report On Customer Lifetime Value
Shubham Ekapure
Belum ada peringkat
XRio Reference Manual
Dokumen89 halaman
XRio Reference Manual
Shree Kiran
Belum ada peringkat
PlanSwif Formulas
Dokumen3 halaman
PlanSwif Formulas
maheshsagaraadarash1
100% (1)
Savan CV
Dokumen3 halaman
Savan CV
xbcbhxbchxbhcb
Belum ada peringkat
Written Test Paper
Dokumen12 halaman
Written Test Paper
Srividhya Ramakrishnan
Belum ada peringkat
God and Golem Review
Dokumen1 halaman
God and Golem Review
Nirmal Patel
Belum ada peringkat
Cap.8 EvaTardos
Dokumen102 halaman
Cap.8 EvaTardos
Bruno Vitiello
Belum ada peringkat
Staad Pro
Dokumen24 halaman
Staad Pro
Ny Ade Hanif
Belum ada peringkat
HitFilm Express 2017 User Guide
Dokumen314 halaman
HitFilm Express 2017 User Guide
Opik Rozikin
100% (2)
Creating Quiz Captivate
Dokumen12 halaman
Creating Quiz Captivate
Sumit Bhardwaj
Belum ada peringkat