Phishing Detection Model (FastText)

This is a lightweight FastText model trained to classify domain names as either phishing or clean. It uses supervised learning with wordNgrams=2 for better n-gram feature coverage.

Installation

Option 1: From Source

git clone https://github.com/facebookresearch/fastText.git
cd fastText
mkdir build && cd build
cmake ..
make

Option 2: Using pip (limited support)

pip install fasttext

⚠️ The pip version does not support all features. Compiling from source is recommended.

Usage

# Predict a single domain
echo "carreeffoursa.site" | ./fasttext predict phishing_model.bin -

Training Info

  • Framework: FastText
  • Labels: __label__phishing, __label__clean
  • Epochs: 10
  • Learning rate: 0.5
  • wordNgrams: 2

πŸ“Š Training Data

The model was trained on mstfknn/phishing-domain-list-2m-plus, a dataset consisting of 2.000,000 domain names labeled as either phishing or clean.

Example

Input:

carreeffoursa.site

Output:

__label__phishing

License

MIT


πŸ”— Links

Downloads last month
25
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Dataset used to train mstfknn/phishing-fasttext-model