--- license: apache-2.0 tags: - Token Classification widget: - text: >- The following is a bubble sort implementation taken from TeamTest57/Whack-A-Mole on github. int iro = 0; int score = 0; void bubble_sort() { int i, j; for (i = 0; i < mole_num - 1; i++) for (j = mole_num - 1; j >= i + 1; j--) if (hole_y[j] < hole_y[j - 1]) { int temp; temp = hole_y[j]; hole_y[j] = hole_y[j - 1]; hole_y[j - 1] = temp; temp = hole_x[j]; hole_x[j] = hole_x[j - 1]; hole_x[j - 1] = temp; } } example_title: example 1 - text: >- # Sample animal inherits from custom metaclass class Panda(metaclass=CustomMeta): """I bet you see this docstring printed as well""" fav_food = "Bamboo" loves_code = True def activity(self): print("Zzz...") This programming code was taken from cyberpanda/PythonStuff on GitHub and is cc0-licensed. It defines a class with member variables and methods. example_title: example 2 --- This is a distilbert-base-multilingual-cased-Model fine-tuned with a NER objective to tag tokens based on whether they belong to a code block or natural language text. The dataset of 78210 examples was generated by randomly combining code and text blocks from other permissively-licensed datasets, with some examples containing only code and some only regular text. The model achieves the following stats on the validation set: | Metric | Value | |--------------|-----------| | Loss | 0.0788 | | F1 Score | 0.8619 | | Precision | 0.8362 | | Recall | 0.8893 | | Accuracy | 0.9792 |