¤H¤u´¼¼zªºÀ³¥Î¹ê§@
§@ªÌ: ®L»F¼Ý
ªì½Z: 20220819
¦ÛµM»y¨¥³B²z(Natural language processing)
¬ì§Þ> [¤H¤u´¼¼z] CNN¡A¼v¹³¤À°Ï¶ô»PRNN
http://cubicpower.idv.tw/cubicnotes/notes-0000038.html
¤å¦r±´°É(Text Mining)
Google Tensorflow: Text
Sentiment analysis- IMDB large movie review dataset
Basic text classification
https://www.tensorflow.org/tutorials/keras/text_classification?hl=zh-tw
°ò¥»¤å¥»¤ÀÃþ
±¡ºü¤ÀªR
¤U¸ü¨Ã±´¯Á IMDB ¼Æ¾Ú¶°
¥[¸ü¼Æ¾Ú¶°
·Ç³Æ¼Æ¾Ú¶°¶i¦æ°V½m
°t¸m¼Æ¾Ú¶°¥H´£°ª©Ê¯à
³Ð«Ø¼Ò«¬
·l¥¢¨ç¼Æ©MÀu¤Æ¾¹
°V½m¼Ò«¬
µû¦ô¼Ò«¬
³Ð«ØÀH®É¶¡Åܤƪº·Ç½T«×©M·l¥¢¹Ï
¾É¥X¼Ò«¬
¹ï·s¼Æ¾Úªº±À½×
½m²ß¡GÃö©ó Stack Overflow °ÝÃDªº¦hÃþ¤ÀÃþ
Word embeddings
https://www.tensorflow.org/text/guide/word_embeddings?hl=zh-tw
µü´O¤J
±N¤å¥»ªí¥Ü¬°¼Æ¦r
One-hot ½s½X
¥Î°ß¤@ªº¼Æ¦r½s½X¨CÓ³æµü
µü´O¤J
³]¸m
¤U¸ü IMDb ¼Æ¾Ú¶°
¨Ï¥Î´O¤J¼h
¤å¥»¹w³B²z
³Ð«Ø¤ÀÃþ¼Ò«¬
½sĶ©M°V½m¼Ò«¬
À˯Á¸g¹L°V½mªºµü´O¤J¨Ã±N¥¦Ì«O¦s¨ìºÏºÐ
¥iµø¤Æ´O¤J
Text classification with an RNN
https://www.tensorflow.org/text/tutorials/text_classification_rnn?hl=zh-tw
¨Ï¥Î RNN ¶i¦æ¤å¥»¤ÀÃþ
³]¸m
³]¸m¿é¤JºÞ¹D
³Ð«Ø¤å¥»½s½X¾¹
³Ð«Ø¼Ò«¬
°V½m¼Ò«¬
°ïÅ|¨âөΦhÓ LSTM ¼h
Classify text with BERT
https://www.tensorflow.org/text/tutorials/classify_text_with_bert?hl=zh-tw
¨Ï¥Î BERT ¹ï¤å¥»¶i¦æ¤ÀÃþ
Ãö©ó BERT
±¡ºü¤ÀªR
±q TensorFlow Hub ¥[¸ü¼Ò«¬
¿ï¾Ü¤@Ó BERT ¼Ò«¬¶i¦æ·L½Õ
¹w³B²z¼Ò«¬
¨Ï¥Î BERT ¼Ò«¬
©w¸q§Aªº¼Ò«¬
¼Ò«¬°V½m
·l¥¢¨ç¼Æ
Àu¤Æ¾¹
¥[¸ü BERT ¼Ò«¬¨Ã¶i¦æ°V½m
µû¦ô¼Ò«¬
ø»sÀH®É¶¡Åܤƪº·Ç½T©Ê©M·l¥¢
¾É¥X±À²z
Search: LogisticRegression
[Day 9] ÅÞ¿è°jÂk(Logistic Regression) - iT ¨¹À°¦£
https://ithelp.ithome.com.tw › articles
[Python¹ê§@]ÅÞ¿è´µ°jÂk¼Ò«¬Logistic Regression - PyInvest
https://pyecontech.com › 2020/02/06 › python_logistic...
Search: kaggle µn¤J
¨âºØ¨ú¥ÎKaggle ¸ê®Æ¶°ªº¤èªk - Nancy SW
https://nancysw.medium.com › ¨âºØ¨ú¥Î-kaggle-¸ê®Æ...
Search: ²á¤Ñ¾÷¾¹¤H python
¦bPython ¤¤¹ê§@¹ï¸Ü«¬²á¤Ñ¾÷¾¹¤H - WANcatServer
https://wancat.cc › post › python-chatbot-context
Search: chatbot python github
zake7749/Chatbot: °ò©ó¦V¶q¤Ç°tªº±¡¹Ò¦¡²á¤Ñ¾÷¾¹¤H - GitHub
https://github.com › zake7749 › Chatbot
Mianbot
¤Ç°t¥Ü¨Ò
Àô¹Ò»Ý¨D
¨Ï¥Î¤è¦¡
²á¤Ñ¾÷¾¹¤H
pºâ¤Ç°t«×
³W«h®æ¦¡
°Ýµª´ú¸Õ¥Î¸ê®Æ¶°
Building a Simple Chatbot from Scratch in Python (using NLTK)
https://github.com › parulnith › Building-...
Building a Simple Chatbot from Scratch in Python (using NLTK)
NLP
Import necessary libraries
Downloading and installing NLTK
Installing NLTK Packages
Reading in the corpus
Tokenisation
Preprocessing
Keyword matching
Generating Response
Bag of Words
TF-IDF Approach
Cosine Similarity
Day 12¡G§Ö³t§¹¦¨¤@Ó¡y¹ï¸Ü¾÷¾¹¤H¡z(ChatBot)
Day 13¡G§Ö³t§¹¦¨¤@Ó¡y¹ï¸Ü¾÷¾¹¤H¡z(ChatBot) -- Äò
Search: ¤Ï¬~¿ú ¼Ò«¬ python
¡i¥É¤sAI¹ê¨Ò3¡j¦Û«Ø¤Ï¬~¿ú¶Â¦W³æ°»´ú¼Ò«¬¡A§Ö³t´ª¥X°ÝÃD
https://www.ithome.com.tw › news
(Top 1% Solution)¥É¤s¤H¤u´¼¼z¤½¶}¬D¾ÔÁÉ2020®L©uÁÉ - Medium
https://medium.com › ¥É¤s¤H¤u´¼¼z¤½¶}¬D¾ÔÁÉ2020®L...
Search: ¨¾¶B´Û¼Ò«¬ python
¾aAI§êºtÃa¤H¨Ó½m§L¡ISAS´¦ÅS¥ÎGAN³]pª÷¿Ä¨¾¶B´Û¼Ò«¬ªº·s ...
https://www.ithome.com.tw › news
¨BÆJ¤@¡Bºc«ØAmazon Fraud Detector ¼Ò«¬
https://pages.awscloud.com › Tech-blog_Amazon-Frau...
Search: ¶B´Û python
python «H¥Î¥d´Û¶B¼Ò«¬«Ø¥ß - µ{¦¡¤H¥Í
https://www.796t.com › content
¸ê®Æ·Ç³Æ: ¨Ó·½©óKaggle
·Ç³Æ¨Ãªì¨BÀ˵ø¸ê®Æ¶°
®É¶¡§Ç¦C¤Uªº¥æ©öµo¥ÍÀW²v¡]¤À¬°¶BÄF©M¥¿±`¡^
¶BÄF©M¥¿±`¥æ©ö¥æ©öª÷ÃBªºÀW²v¤À§G
¦U¯S¼x©M¦]ÅܼƪºÃö«Y
¥ÎÅÞ¿è°jÂk¤èªk¹ï«H¥Î¥d¸ê®Æ¶i¦æ«Ø¼Ò¤ÀªR
«H¥Î¥d¶BÄF¤ÀªR-¤£¥¿Å¸ê®Æ¤ÀªR»P³B²zkernel½Ķ-§¹¾ãª©
https://medium.com › ¾÷¾¹¾Ç²ßª¾ÃѾúµ{ › «H¥Î¥d¶BÄF...
¹w³B²z
ÁY©ñ©M¤À°t Scaling and Distributing
©î¤À¼Æ¾Ú Splitting the Data¡]±qì©lDataFrame¡^
ÀH¾÷¤í±Ä¼Ë©M¹L±Ä¼Ë
¤À§G©M¬ÛÃö©Ê Distributing and Correlating
²§±`ÀË´ú Anomaly Detection
°ºû©M¤À¸s Dimensionality Reduction and Clustering (t-SNE)
¤ÀÃþ¾¹ Classifiers
§ó²`¤J¦a¤F¸ÑÅÞ¿è¦^Âk A Deeper Look into Logistic Regression
¨Ï¥ÎSMOTE¶i¦æ¹L±Ä¼Ë Oversampling with SMOTE
´ú¸Õ
¨Ï¥ÎÅÞ¿è¦^Âk¶i¦æ´ú¸Õ Test Data with Logistic Regression
¯«¸gºôµ¸´ú¸Õ¡]¤í±Ä¼Ë»P¹L±Ä¼Ë¡^Neural Networks Testing (Undersampling vs Oversampling)
Part 5. Imbalanced Data ¤£¥¿Å¸ê®Æ - iT ¨¹À°¦£
https://ithelp.ithome.com.tw › articles
µû¦ô«ü¼Ð
A. Confusion Matrix ²V²c¯x°}
B. Precision and Recall ºë½T²v»P¥l¦^²v
C. F1 score
D. ROC(Receiver Operating Characteristic) ±µ¦¬ªÌ¾Þ§@¯S¼x¦±½u
¦±½u¤U±¿nºÙ Area Under Curve (AUC)
«²Õ¸ê®Æ
A. Oversampling ¹L±Ä¼Ë
A1. SMOTE (Synthetic Minority Oversampling Technique)
A2. Border Line SMOTE
B. Undersampling ¤í±Ä¼Ë
B1. Tomek Link
B2. Edited Nearest Neighbor
ª`·N¨Æ¶µ
A. ¥ý¤Á¤À¸ê®Æ¡A¦A¹ï°V½m¸ê®Æ±Ä¼Ë¡C
B. ±`³z¹L¥æ¤eÅçÃÒ±±¨î¹LÀÀ¦X¡C
C. Æ[¹î¤Ö¼Æ¼Ë¥»»P¦h¼Æ¼Ë¥»¤À¥¬±¡§Î¡C
Search: Credit Fraud || Dealing with Imbalanced Datasets
Credit Fraud || Dealing with Imbalanced Datasets - Kaggle
https://www.kaggle.com › janiobachmann
Credit Fraud || Dealing with Imbalanced Datasets
https://www.kaggle.com/janiobachmann/credit-fraud-dealing-with-imbalanced-datasets
https://www.kaggle.com/code/janiobachmann/credit-fraud-dealing-with-imbalanced-datasets/notebook
Search: «H¥Îµû¤À python
Python¤§«H¥Îµû¤À¥d¼Ò«¬¹ê²{ - GetIt01
°ò©óPythonªº«H¥Îµû¤À¼Ò«¬¶}µo-ªþ¸ê®Æ©Mµ{¦¡½X - ¥j¸Öµü®w
https://www.gushiciku.cn › zh-tw
±M®×¬yµ{
¸ê®ÆÀò¨ú
¸ê®Æ¹w³B²z
¯Ê¥¢È³B²z
²§±`ȳB²z
¸ê®Æ¤Á¤À: ¤À¦¨ °V½m¶°©M´ú¸Õ¶°
±´¯Á©Ê¤ÀªR
Åܼƿï¾Ü
¤À½c³B²z
WOE
¬ÛÃö©Ê¤ÀªR©MIV¿z¿ï
¼Ò«¬¤ÀªR
WOEÂà´«
Logisic¼Ò«¬«Ø¥ß
¼Ò«¬ÀËÅç
«H¥Îµû¤À
¦Û°Êµû¤À¨t²Î
Search: GiveMeSomeCredit
jwu424/GiveMeSomeCredit - GitHub
https://github.com › jwu424 › GiveMeSo...
«H¥ÎÃB: line of credit¡Acredit line.
WOE¡]Weight of Evidence¡^ÃÒ¾ÚÅv«¡A±`¥Î©ó¯S¼xÅÜ´«¡A
IV¡]Information Value¡^¸ê°T»ùÈ¡A©ÎªÌ¸ê°T¶q¡A¥Î¨Ó¿Å¶q¯S¼xªº¹w´ú¯à¤O¡C
1. WOE describes the relationship between a predictive variable and a binary target variable.
2. IV measures the strength of that relationship.
1. WOE ´yz¤F¹w´úÅܶq©M¤G¤¸¥Ø¼ÐÅܶq¤§¶¡ªºÃö«Y¡C
2. IV ¿Å¶q³oºØÃö«Yªº±j«×¡C
Search: woe iv
¹ïwoe©Mivªº¤@¨Ç²z¸Ñ©M¬Ýªk - ¤H¤HµJÂI
Search: woe iv python
https://arsene5240.medium.com › woeȤÎivÈ-python...
Search: ºë·Ç¦æ¾P python
¾÷¾¹¾Ç²ß¨t¦C¤¡G½Ö·|ñ¬ù¡H¥H¡uºë·Ç¦æ¾P¼Ò«¬¡vµû¦ôÅU«È±a¨Ó ...
https://medium.com › marketingdatascience › ¾÷¾¹¾Ç²ß...
¾÷¾¹¾Ç²ßX ºë·Ç¦æ¾PKDD 2.0µ{§Ç¡G¡i¤º³¡¸ê®Æ¡j¹ê®×À³¥Î¡]ªþ ...
https://medium.com › marketingdatascience › ¾÷¾¹¾Ç²ß...
Search: «È¤á¤À¸s python
Python¥ÎK-means»EÃþºtºâªk¶i¦æ«È¤á¤À¸sªº¹ê²{ - µ{¦¡¤H¥Í
https://www.796t.com › article
Day 02¡G«È¤á¤À¸s(Customer Segmentation) -- ¨º¨Ç«È¤á¬OVIP?
«È¤á¤À¸s(Customer Segmentation) -- ¨º¨Ç«È¤á¬O§ÚªºVIP? (Äò)
https://ithelp.ithome.com.tw › articles
Search: ¤å¦rÃöÁp python
¡ipython¸ê®Æ±´°É½Òµ{¡j¤G¤Q¥|.KMeans¤å¦r»EÃþ¤ÀªR¤¬°Ê¦Ê¬ì ...
https://codertw.com › µ{¦¡»y¨¥
https://hackmd.io › python-bigdata-02
¤å¦r¶³
µ²¤Ú
¤Àµü Â_µü
µüÀW
°±¥Îµü
Search: Python ¤å¦r±´°É
¤å¥ó±´°É(Text Mining) — §â¤å¦r¥Î¼Æ¦rªí¥Ü - Medium
https://medium.com › ¥øÃZ¤]À´µ{¦¡³]p › ¤å¥ó±´°É-te...
Text Mining & ºô¸ôª¦ÂÎweb crawler | Google·s»D»P¤å³¹¤å¦r¶³
https://jamleecute.web.app › ºô¸ôª¦ÂÎ-web-crawler-text...
Search: Python ¤å¦r±´°É À³¥Î
[Python¾÷¾¹¾Ç²ß]-¦Û°Ê§PÂ_¯d¨¥¥¿tµû(¹B¥ÎBERT model¡^with ...
https://medium.com › python¾÷¾¹¾Ç²ß-google§Úªº°Ó...
Search: Python ¤å¦r±´°É §Þ³N
python¤å¦r±´°É¡A¸ê®Æ«e³B²z¬yµ{¤¶²Ð
https://dannypheobe.blogspot.com › 2016/07 › python
10 Text Mining - LearnPython - GitBook
https://datasciencetw.gitbook.io › python › 10-text-mini...
Google Tensorflow: Text
Sentiment analysis- IMDB large movie review dataset
Basic text classification
https://www.tensorflow.org/tutorials/keras/text_classification?hl=zh-tw
Word embeddings
https://www.tensorflow.org/text/guide/word_embeddings?hl=zh-tw
Text classification with an RNN
https://www.tensorflow.org/text/tutorials/text_classification_rnn?hl=zh-tw
Classify text with BERT
https://www.tensorflow.org/text/tutorials/classify_text_with_bert?hl=zh-tw