{ "cells": [ { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "# !pip install sentence-transformers==2.0.0" ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "c:\\Users\\ardit\\AppData\\Local\\Programs\\Python\\Python39\\lib\\site-packages\\torch\\onnx\\_internal\\_beartype.py:30: UserWarning: module 'beartype.roar' has no attribute 'BeartypeDecorHintPep585DeprecationWarning'\n", " warnings.warn(f\"{e}\")\n" ] } ], "source": [ "import pandas as pd\n", "from tqdm import tqdm\n", "from sentence_transformers import SentenceTransformer\n", "\n", "model = SentenceTransformer('all-mpnet-base-v2') #all-MiniLM-L6-v2 #all-mpnet-base-v2" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "df = pd.read_parquet('df_encoded.parquet')\n", "df" ] }, { "cell_type": "code", "execution_count": 17, "metadata": {}, "outputs": [], "source": [ "from sklearn.neighbors import NearestNeighbors\n", "import numpy as np\n", "import pandas as pd\n", "\n", "from sentence_transformers import SentenceTransformer\n", "\n", "model = SentenceTransformer('all-mpnet-base-v2') #all-MiniLM-L6-v2 #all-mpnet-base-v2\n", "\n", "#prepare model\n", "# nbrs = NearestNeighbors(n_neighbors=8, algorithm='ball_tree').fit(df['text_vector_'].values.tolist())" ] }, { "cell_type": "code", "execution_count": 19, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", " | shortName | \n", "location | \n", "title | \n", "hourlyRate | \n", "avgFeedbackScore | \n", "description | \n", "
---|---|---|---|---|---|---|
3761 | \n", "Steven F. | \n", "New York | \n", "Database Manger / Graphics / Social Media | \n", "35.0 | \n", "4.939631 | \n", "A highly skilled problem solver​ with over 10 ... | \n", "
835 | \n", "Jacquelyn N. | \n", "New York | \n", "Admin support specialist | \n", "12.0 | \n", "4.920992 | \n", "I am a Kansas City based Virtual Assistant. I ... | \n", "
2787 | \n", "Mark H. | \n", "New York | \n", "WordPress Specialist - Development, Administra... | \n", "60.0 | \n", "4.751762 | \n", "Top Rated Plus | Specialize in WordPress | Inv... | \n", "
3402 | \n", "Carleton C. | \n", "New York | \n", "Expert freelancer with skills in Divi theme, C... | \n", "25.0 | \n", "4.692159 | \n", "For over 30 years, I have developed a wide ran... | \n", "
1156 | \n", "Andee F. | \n", "New York | \n", "Experienced Freelancer | \n", "15.0 | \n", "4.645855 | \n", "I have 8+ years of successfully providing admi... | \n", "
1556 | \n", "Laura O. | \n", "New York | \n", "Admin Expert with experience in Microsoft Suit... | \n", "30.0 | \n", "4.620818 | \n", "I have been passionate about my personal budge... | \n", "
1002 | \n", "Nicole H. | \n", "New York | \n", "Experienced admin support and customer support... | \n", "30.0 | \n", "4.129972 | \n", "I'm an experienced jack of all trades. I have... | \n", "
1626 | \n", "Drew L. | \n", "New York | \n", "Front End Web Developer | \n", "50.0 | \n", "0.000000 | \n", "Worked for agency with big name clients doing ... | \n", "