File size: 4,768 Bytes
aed5dfc
135096f
69d3ad3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
aed5dfc
 
 
673c126
 
 
 
69d3ad3
aed5dfc
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
673c126
 
aed5dfc
c0bf361
9bb90ac
c0bf361
e697a24
 
aed5dfc
e697a24
64a9914
 
 
 
 
 
 
ac17402
bde2b28
 
2485d30
8448181
2485d30
96a133c
2485d30
 
 
 
 
 
 
 
 
 
 
 
aed5dfc
e697a24
aed5dfc
 
 
8448181
 
2485d30
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8852942
2485d30
 
 
 
8852942
69d3ad3
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
---
widget:
- text: |2-
     John Doe
     123 Main Street, Cityville, CA 12345
     johndoe@email.com
     (555) 123-4567
     linkedin.com/in/johndoe
     Professional Summary
     Experienced and results-driven Data Scientist with a strong background in statistical analysis, machine learning, and data visualization. Proven track record of delivering actionable insights and driving data-driven decision-making processes. Adept at leveraging advanced analytics to solve complex business problems.

    Education
     Master of Science in Data Science
     ABC University, Cityville, CA
     May 2021
     
     Bachelor of Science in Computer Science
     XYZ University, Townsville, CA
     Graduation Date: May 2018
     
     Professional Experience
     Data Scientist | Tech Innovators Inc., Cityville, CA | June 2021 - Present
     Lead data analysis projects, extracting valuable insights to inform business strategies.
     Develop and deploy machine learning models to optimize key processes, resulting in a 15% increase in efficiency.
     Collaborate with cross-functional teams to design and implement data-driven solutions.
     Utilize Python, R, and SQL for data extraction, transformation, and analysis.
     Create compelling data visualizations to communicate findings to non-technical stakeholders.

    Data Analyst | Data Solutions Co., Townsville, CA | January 2019 - May 2021
     Conducted exploratory data analysis to identify trends, patterns, and anomalies.
     Implemented data cleaning and preprocessing techniques to ensure data quality.
     Produced comprehensive reports and dashboards, aiding in executive decision-making.
     Collaborated with business units to define and refine analytical requirements.

    Skills
     Programming Languages: Python, R
     Data Analysis Tools: Pandas, NumPy
     Machine Learning: Scikit-Learn, TensorFlow, Keras
     Database Management: SQL
     Data Visualization: Matplotlib, Seaborn
     Statistical Analysis: Hypothesis testing, Regression analysis
     Communication: Strong written and verbal communication skills

    Certifications
     Certified Data Scientist (CDS)
     Machine Learning Specialist Certification
     
tags:
- spacy
- token-classification
- cv
- resume parsing
- resume extraction
- named entity recognition
- resume
language:
- en
model-index:
- name: en_cv_info_extr
  results:
  - task:
      name: NER
      type: token-classification
    metrics:
    - name: NER Precision
      type: precision
      value: 0.8333333333
    - name: NER Recall
      type: recall
      value: 0.8067729084
    - name: NER F Score
      type: f_score
      value: 0.8198380567
library_name: spacy
pipeline_tag: token-classification
---

# Information extraction from Resumes/CVs written in English

### Model Description
This model is designed for information extraction from resumes/CVs written in English. It employs a transformer-based architecture with spaCy for named entity recognition (NER) tasks. The model aims to parse various sections of resumes, including personal details, education history, professional experience, skills, and certifications, enabling users to extract structured information for further processing or analysis.

### Model Details
| Feature | Description |
| --- | --- |
| `Language` | English |
| `Task` | Named Entity Recognition (NER) |
| `Objective` | Information extraction from resumes/CVs |
| `Spacy Components` | Transformer, Named Entity Recognition (NER) |
| `Author` | [Youssef Chafiqui](https://huggingface.co/ychafiqui) |

### NER Entities
The model recognizes various entities corresponding to different sections of a resume. Below are the entities used by the model:
| Label | Description |
| --- | --- |
| 'FNAME' | First name |
| 'LNAME' | Last name |
| 'ADDRESS' | Address |
| 'CERTIFICATION' | Certification |
| 'EDUCATION' | Education section |
| 'EMAIL' | Email address |
| 'EXPERIENCE' | Experience section |
| 'HOBBY' | Hobby |
| 'HSKILL' | Hard skill |
| 'LANGUAGE' | Language |
| 'PHONE' | Phone number |
| 'PROFILE' | Profile |
| 'PROJECT' | Project section |
| 'SSKILL' | Soft skill |

### Evaluation Metrics

| Type | Score |
| --- | --- |
| `F1 score` | 81.98 |
| `Precision` | 83.33 |
| `Recall` | 80.68 |

## Usage
### Presequities
Install spaCy library
```bash
pip install spacy
```

Install Transformers library
```bash
pip install transformers
```

Download the model
```bash
pip install https://huggingface.co/ychafiqui/en_cv_info_extr/resolve/main/en_cv_info_extr-any-py3-none-any.whl
```

### Load the model
```python
import spacy
nlp = spacy.load("en_cv_info_extr")
```

### Inference using the model
```python
doc = nlp('put your resume here')

for ent in doc.ents:
  print(ent.text, "-", ent.label_)
```