Review Your NLP Knowledge

Review Your NLP Knowledge

1. Abbreviated Words in NLP:

  • LSTM: Long Short Term Memory
  • Bert: Bidirectional Encoder Representations from Transformers.
  • POS: parts of speech.
  • DTM: Document Term Matrix.
  • NER: name entity recognition.
  • NLG: Natural Language Generation.
  • NLU: Natural Language Understanding.
  • TF IDF: Term Frequency–Inverse Document Frequency.
  • re: Regular expression.
  • LDA: Latent Dirichlet Allocation.
  • LSI: Latent Semantic Indexing.
  • NMF: Non-Negative Matrix Factorization.
  • NLTK: Natural Language Toolkit

2. Some Common Steps for NLP Problems:

  • Sentence Segmentation: break the text apart into separate sentences
  • Tokenization: split Sentence to words
  • Stemming: process of reducing words to their word stem for example thinking→ think
  • Lemmatizing: for example worse→ bad
  • POS tags: Predicting Parts of Speech for Each Token
  • Identifying Stop Words: like “and”, “the”
  • Name entity recognition: detect nouns with the real world concepts.
  • Text classification
  • Chunking
  • Coreference resolution

3. Applications of NLP in The Real World:

  • Personal assistant applications
  • Fighting spam
  • Chatbots
  • Managing the Advertisement
  • Sentiment analysis
  • Text classification
  • Text summarization
  • Toxicity Classification
  • Name entity recognization
  • Part of speech tagging
  • Language model building
  • Machine translation
  • Spell checking
  • Speech recognition
  • Character recognition

4. Python Library for NLP:

5. A few terms in NLP:

  • Stop words
  • Punctuation
  • Word embedding
  • Word segmentation
  • Text summarization
  • Regular expression
  • Morphological segmentation
  • Named entity recognition
  • Corpus: A collection of texts
  • Document-Term Matrix
  • n-gram: tokenize sentences by n words combination
  • Latent Dirichlet Allocation: a technique for topic modelling.

6. Word Embedding Libraries:

  • Word2vec
  • Glove
  • Fasttext
  • Genism

7. Some Useful links for Learning NLP:

8. NLP Engineer Interview Question:

9. Great Tutorials for NLTK & spaCy:

10. Some Great Topics in Kaggle

It’s a good idea to read the following topics because you can review almost all the issues that are relevant to this competition:

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章