´¹¤åºK

[¤H¤u´¼¼z] ¤H¤u´¼¼zªºÀ³¥Î¹ê§@

µ¹´¹·s»D¤@­ÓÆg



¤H¤u´¼¼zªºÀ³¥Î¹ê§@


§@ªÌ: ®L»F¼Ý

ªì½Z: 20220819






¦ÛµM»y¨¥³B²z(Natural language  processing)

¬ì§Þ> [¤H¤u´¼¼z] CNN¡A¼v¹³¤À°Ï¶ô»PRNN

http://cubicpower.idv.tw/cubicnotes/notes-0000038.html


¤å¦r±´°É(Text Mining)

Google Tensorflow: Text

Sentiment analysis- IMDB large movie review dataset

Basic text classification 

https://www.tensorflow.org/tutorials/keras/text_classification?hl=zh-tw

°ò¥»¤å¥»¤ÀÃþ

±¡ºü¤ÀªR

¤U¸ü¨Ã±´¯Á IMDB ¼Æ¾Ú¶°

¥[¸ü¼Æ¾Ú¶°

·Ç³Æ¼Æ¾Ú¶°¶i¦æ°V½m

°t¸m¼Æ¾Ú¶°¥H´£°ª©Ê¯à

³Ð«Ø¼Ò«¬

·l¥¢¨ç¼Æ©MÀu¤Æ¾¹

°V½m¼Ò«¬

µû¦ô¼Ò«¬

³Ð«ØÀH®É¶¡Åܤƪº·Ç½T«×©M·l¥¢¹Ï

¾É¥X¼Ò«¬

¹ï·s¼Æ¾Úªº±À½×

½m²ß¡GÃö©ó Stack Overflow °ÝÃDªº¦hÃþ¤ÀÃþ


Word embeddings

https://www.tensorflow.org/text/guide/word_embeddings?hl=zh-tw

µü´O¤J

±N¤å¥»ªí¥Ü¬°¼Æ¦r

One-hot ½s½X

¥Î°ß¤@ªº¼Æ¦r½s½X¨C­Ó³æµü

µü´O¤J

³]¸m

¤U¸ü IMDb ¼Æ¾Ú¶°

¨Ï¥Î´O¤J¼h

¤å¥»¹w³B²z

³Ð«Ø¤ÀÃþ¼Ò«¬

½sĶ©M°V½m¼Ò«¬

À˯Á¸g¹L°V½mªºµü´O¤J¨Ã±N¥¦­Ì«O¦s¨ìºÏºÐ

¥iµø¤Æ´O¤J


Text classification with an RNN 

https://www.tensorflow.org/text/tutorials/text_classification_rnn?hl=zh-tw

¨Ï¥Î RNN ¶i¦æ¤å¥»¤ÀÃþ

³]¸m

³]¸m¿é¤JºÞ¹D

³Ð«Ø¤å¥»½s½X¾¹

³Ð«Ø¼Ò«¬

°V½m¼Ò«¬

°ïÅ|¨â­Ó©Î¦h­Ó LSTM ¼h


Classify text with BERT

https://www.tensorflow.org/text/tutorials/classify_text_with_bert?hl=zh-tw

¨Ï¥Î BERT ¹ï¤å¥»¶i¦æ¤ÀÃþ

Ãö©ó BERT

±¡ºü¤ÀªR

±q TensorFlow Hub ¥[¸ü¼Ò«¬

¿ï¾Ü¤@­Ó BERT ¼Ò«¬¶i¦æ·L½Õ

¹w³B²z¼Ò«¬

¨Ï¥Î BERT ¼Ò«¬

©w¸q§Aªº¼Ò«¬

¼Ò«¬°V½m

·l¥¢¨ç¼Æ

Àu¤Æ¾¹

¥[¸ü BERT ¼Ò«¬¨Ã¶i¦æ°V½m

µû¦ô¼Ò«¬

ø»sÀH®É¶¡Åܤƪº·Ç½T©Ê©M·l¥¢

¾É¥X±À²z


Search: LogisticRegression

 

[Day 9] ÅÞ¿è°jÂk(Logistic Regression) - iT ¨¹À°¦£

https://ithelp.ithome.com.tw › articles

 

[Python¹ê§@]ÅÞ¿è´µ°jÂk¼Ò«¬Logistic Regression - PyInvest

https://pyecontech.com › 2020/02/06 › python_logistic...

 

 

 

 

Search: kaggle µn¤J

 

¨âºØ¨ú¥ÎKaggle ¸ê®Æ¶°ªº¤èªk - Nancy SW

https://nancysw.medium.com › ¨âºØ¨ú¥Î-kaggle-¸ê®Æ...

 



Search:  ²á¤Ñ¾÷¾¹¤H python


¦bPython ¤¤¹ê§@¹ï¸Ü«¬²á¤Ñ¾÷¾¹¤H - WANcatServer

https://wancat.cc › post › python-chatbot-context


Search:  chatbot python github

 

zake7749/Chatbot: °ò©ó¦V¶q¤Ç°tªº±¡¹Ò¦¡²á¤Ñ¾÷¾¹¤H - GitHub

https://github.com › zake7749 › Chatbot

Mianbot

¤Ç°t¥Ü¨Ò

Àô¹Ò»Ý¨D

¨Ï¥Î¤è¦¡

²á¤Ñ¾÷¾¹¤H

­pºâ¤Ç°t«×

³W«h®æ¦¡

°Ýµª´ú¸Õ¥Î¸ê®Æ¶°


 

 

Building a Simple Chatbot from Scratch in Python (using NLTK)

https://github.com › parulnith › Building-...

Building a Simple Chatbot from Scratch in Python (using NLTK)

NLP

Import necessary libraries

Downloading and installing NLTK

Installing NLTK Packages

Reading in the corpus

Tokenisation

Preprocessing

Keyword matching

Generating Response

Bag of Words

TF-IDF Approach

Cosine Similarity

 


Day 12¡G§Ö³t§¹¦¨¤@­Ó¡y¹ï¸Ü¾÷¾¹¤H¡z(ChatBot)

Day 13¡G§Ö³t§¹¦¨¤@­Ó¡y¹ï¸Ü¾÷¾¹¤H¡z(ChatBot) -- Äò




Search:  ¤Ï¬~¿ú ¼Ò«¬ python

 

¡i¥É¤sAI¹ê¨Ò3¡j¦Û«Ø¤Ï¬~¿ú¶Â¦W³æ°»´ú¼Ò«¬¡A§Ö³t´ª¥X°ÝÃD

https://www.ithome.com.tw › news

 

 

(Top 1% Solution)¥É¤s¤H¤u´¼¼z¤½¶}¬D¾ÔÁÉ2020®L©uÁÉ - Medium

https://medium.com › ¥É¤s¤H¤u´¼¼z¤½¶}¬D¾ÔÁÉ2020®L...

 

 

Search:  ¨¾¶B´Û¼Ò«¬ python


¾aAI§êºtÃa¤H¨Ó½m§L¡ISAS´¦ÅS¥ÎGAN³]­pª÷¿Ä¨¾¶B´Û¼Ò«¬ªº·s ...

https://www.ithome.com.tw › news

 

 

¨BÆJ¤@¡Bºc«ØAmazon Fraud Detector ¼Ò«¬

https://pages.awscloud.com › Tech-blog_Amazon-Frau...

 

Search:  ¶B´Û python

 

python «H¥Î¥d´Û¶B¼Ò«¬«Ø¥ß - µ{¦¡¤H¥Í

https://www.796t.com › content

¸ê®Æ·Ç³Æ: ¨Ó·½©óKaggle

·Ç³Æ¨Ãªì¨BÀ˵ø¸ê®Æ¶°

®É¶¡§Ç¦C¤Uªº¥æ©öµo¥ÍÀW²v¡]¤À¬°¶BÄF©M¥¿±`¡^

¶BÄF©M¥¿±`¥æ©ö¥æ©öª÷ÃBªºÀW²v¤À§G

¦U¯S¼x©M¦]ÅܼƪºÃö«Y

¥ÎÅÞ¿è°jÂk¤èªk¹ï«H¥Î¥d¸ê®Æ¶i¦æ«Ø¼Ò¤ÀªR

 

 

«H¥Î¥d¶BÄF¤ÀªR-¤£¥­¿Å¸ê®Æ¤ÀªR»P³B²zkernel½Ķ-§¹¾ãª©

https://medium.com › ¾÷¾¹¾Ç²ßª¾ÃѾúµ{ › «H¥Î¥d¶BÄF...

¹w³B²z

ÁY©ñ©M¤À°t Scaling and Distributing

©î¤À¼Æ¾Ú Splitting the Data¡]±q­ì©lDataFrame¡^

ÀH¾÷¤í±Ä¼Ë©M¹L±Ä¼Ë

¤À§G©M¬ÛÃö©Ê Distributing and Correlating

²§±`ÀË´ú Anomaly Detection

­°ºû©M¤À¸s Dimensionality Reduction and Clustering (t-SNE)

¤ÀÃþ¾¹ Classifiers

§ó²`¤J¦a¤F¸ÑÅÞ¿è¦^Âk A Deeper Look into Logistic Regression

¨Ï¥ÎSMOTE¶i¦æ¹L±Ä¼Ë Oversampling with SMOTE

´ú¸Õ

¨Ï¥ÎÅÞ¿è¦^Âk¶i¦æ´ú¸Õ Test Data with Logistic Regression

¯«¸gºôµ¸´ú¸Õ¡]¤í±Ä¼Ë»P¹L±Ä¼Ë¡^Neural Networks Testing (Undersampling vs Oversampling)

 

 

Part 5. Imbalanced Data ¤£¥­¿Å¸ê®Æ - iT ¨¹À°¦£

https://ithelp.ithome.com.tw › articles

µû¦ô«ü¼Ð

A. Confusion Matrix ²V²c¯x°}

B. Precision and Recall ºë½T²v»P¥l¦^²v

C. F1 score

D. ROC(Receiver Operating Characteristic) ±µ¦¬ªÌ¾Þ§@¯S¼x¦±½u

¦±½u¤U­±¿nºÙ Area Under Curve (AUC)

 ­«²Õ¸ê®Æ

A. Oversampling ¹L±Ä¼Ë

A1. SMOTE  (Synthetic Minority Oversampling Technique)

A2. Border Line SMOTE

B. Undersampling ¤í±Ä¼Ë

B1. Tomek Link

B2. Edited Nearest Neighbor

ª`·N¨Æ¶µ

A. ¥ý¤Á¤À¸ê®Æ¡A¦A¹ï°V½m¸ê®Æ±Ä¼Ë¡C

B. ±`³z¹L¥æ¤eÅçÃÒ±±¨î¹LÀÀ¦X¡C

C. Æ[¹î¤Ö¼Æ¼Ë¥»»P¦h¼Æ¼Ë¥»¤À¥¬±¡§Î¡C



Search:  Credit Fraud || Dealing with Imbalanced Datasets

 

Credit Fraud || Dealing with Imbalanced Datasets - Kaggle

https://www.kaggle.com › janiobachmann

 

Credit Fraud || Dealing with Imbalanced Datasets

https://www.kaggle.com/janiobachmann/credit-fraud-dealing-with-imbalanced-datasets

 

https://www.kaggle.com/code/janiobachmann/credit-fraud-dealing-with-imbalanced-datasets/notebook

 

 

 


Search:  «H¥Îµû¤À python


Python¤§«H¥Îµû¤À¥d¼Ò«¬¹ê²{ - GetIt01

https://www.getit01.com › ...


°ò©óPythonªº«H¥Îµû¤À¼Ò«¬¶}µo-ªþ¸ê®Æ©Mµ{¦¡½X - ¥j¸Öµü®w

https://www.gushiciku.cn › zh-tw

±M®×¬yµ{ 

¸ê®ÆÀò¨ú 

¸ê®Æ¹w³B²z 

¯Ê¥¢­È³B²z

²§±`­È³B²z

¸ê®Æ¤Á¤À: ¤À¦¨ °V½m¶°©M´ú¸Õ¶°

±´¯Á©Ê¤ÀªR 

Åܼƿï¾Ü 

¤À½c³B²z

WOE

¬ÛÃö©Ê¤ÀªR©MIV¿z¿ï

¼Ò«¬¤ÀªR 

WOEÂà´«

Logisic¼Ò«¬«Ø¥ß

¼Ò«¬ÀËÅç

«H¥Îµû¤À

¦Û°Êµû¤À¨t²Î 



Search:  GiveMeSomeCredit

jwu424/GiveMeSomeCredit - GitHub

https://github.com › jwu424 › GiveMeSo...

«H¥Î­­ÃB:  line of credit¡Acredit line.

WOE¡]Weight of Evidence¡^ÃÒ¾ÚÅv­«¡A±`¥Î©ó¯S¼xÅÜ´«¡A

IV¡]Information Value¡^¸ê°T»ù­È¡A©ÎªÌ¸ê°T¶q¡A¥Î¨Ó¿Å¶q¯S¼xªº¹w´ú¯à¤O¡C

1. WOE describes the relationship between a predictive variable and a binary target variable.

2. IV measures the strength of that relationship.

1. WOE ´y­z¤F¹w´úÅܶq©M¤G¤¸¥Ø¼ÐÅܶq¤§¶¡ªºÃö«Y¡C

2. IV ¿Å¶q³oºØÃö«Yªº±j«×¡C



Search:  woe iv 

 

¹ïwoe©Mivªº¤@¨Ç²z¸Ñ©M¬Ýªk - ¤H¤HµJÂI

https://ppfocus.com › …

 

 

Search:  woe iv python


WOE­È¤ÎIV­È|Pythonµ{¦¡½X

https://arsene5240.medium.com › woe­È¤Îiv­È-python...

 


Search:  ºë·Ç¦æ¾P python

 

¾÷¾¹¾Ç²ß¨t¦C¤­¡G½Ö·|ñ¬ù¡H¥H¡uºë·Ç¦æ¾P¼Ò«¬¡vµû¦ôÅU«È±a¨Ó ...

https://medium.com › marketingdatascience › ¾÷¾¹¾Ç²ß...


¾÷¾¹¾Ç²ßX ºë·Ç¦æ¾PKDD 2.0µ{§Ç¡G¡i¤º³¡¸ê®Æ¡j¹ê®×À³¥Î¡]ªþ ...

https://medium.com › marketingdatascience › ¾÷¾¹¾Ç²ß...




Search:  «È¤á¤À¸s python

 

Python¥ÎK-means»EÃþºtºâªk¶i¦æ«È¤á¤À¸sªº¹ê²{ - µ{¦¡¤H¥Í

https://www.796t.com › article

 


Day 02¡G«È¤á¤À¸s(Customer Segmentation) -- ¨º¨Ç«È¤á¬OVIP?

 

«È¤á¤À¸s(Customer Segmentation) -- ¨º¨Ç«È¤á¬O§ÚªºVIP? (Äò)

https://ithelp.ithome.com.tw › articles



Search:  ¤å¦rÃöÁp python

 

¡ipython¸ê®Æ±´°É½Òµ{¡j¤G¤Q¥|.KMeans¤å¦r»EÃþ¤ÀªR¤¬°Ê¦Ê¬ì ...

https://codertw.com › µ{¦¡»y¨¥

 


 

Python¤j¼Æ¾Ú¤ÀªR(¤G) - HackMD

https://hackmd.io › python-bigdata-02

 



¤å¦r¶³

µ²¤Ú

¤Àµü Â_µü

µüÀW

°±¥Îµü



Search:  Python ¤å¦r±´°É


¤å¥ó±´°É(Text Mining) — §â¤å¦r¥Î¼Æ¦rªí¥Ü - Medium

https://medium.com › ¥øÃZ¤]À´µ{¦¡³]­p › ¤å¥ó±´°É-te...

 

 

Text Mining & ºô¸ôª¦ÂÎweb crawler | Google·s»D»P¤å³¹¤å¦r¶³

https://jamleecute.web.app › ºô¸ôª¦ÂÎ-web-crawler-text...

 


Search:  Python ¤å¦r±´°É À³¥Î

 

[Python¾÷¾¹¾Ç²ß]-¦Û°Ê§PÂ_¯d¨¥¥¿­tµû(¹B¥ÎBERT model¡^with ...

https://medium.com › python¾÷¾¹¾Ç²ß-google§Úªº°Ó...


Search:  Python ¤å¦r±´°É §Þ³N

 

python¤å¦r±´°É¡A¸ê®Æ«e³B²z¬yµ{¤¶²Ð

https://dannypheobe.blogspot.com › 2016/07 › python

 

 

10 Text Mining - LearnPython - GitBook

https://datasciencetw.gitbook.io › python › 10-text-mini...

 

 



Google Tensorflow: Text


Sentiment analysis- IMDB large movie review dataset


Basic text classification 

https://www.tensorflow.org/tutorials/keras/text_classification?hl=zh-tw

Word embeddings

https://www.tensorflow.org/text/guide/word_embeddings?hl=zh-tw

Text classification with an RNN 

https://www.tensorflow.org/text/tutorials/text_classification_rnn?hl=zh-tw

Classify text with BERT

https://www.tensorflow.org/text/tutorials/classify_text_with_bert?hl=zh-tw