wav2vec2-large-mms-1b-DZ

This model is a fine-tuned version of facebook/mms-1b-all on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.1983
  • Wer: 0.1456
  • Bleu: {'bleu': 0.7492044741492486, 'precisions': [0.8730063030967389, 0.7883760255527025, 0.7152661166678835, 0.6510430304617264], 'brevity_penalty': 0.9957339815437508, 'length_ratio': 0.995743055176554, 'translation_length': 18245, 'reference_length': 18323}
  • Rouge: {'rouge1': 0.8748364322650914, 'rouge2': 0.7928723605220098, 'rougeL': 0.8745015805780032, 'rougeLsum': 0.8743113489400196}

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 8
  • eval_batch_size: 16
  • seed: 42
  • gradient_accumulation_steps: 4
  • total_train_batch_size: 32
  • optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 500
  • num_epochs: 40
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Wer Bleu Rouge
5.9588 1.0 285 2.7873 0.9827 {'bleu': 0.0, 'precisions': [0.059154929577464786, 0.0, 0.0, 0.0], 'brevity_penalty': 0.11433624320676729, 'length_ratio': 0.3155956311912624, 'translation_length': 2485, 'reference_length': 7874} {'rouge1': 0.03875997597517414, 'rouge2': 0.0, 'rougeL': 0.03892048827301332, 'rougeLsum': 0.038793322006122906}
0.4602 2.0 570 0.3555 0.2311 {'bleu': 0.6283054199052752, 'precisions': [0.7993283417749394, 0.6805363212891854, 0.5854142185663925, 0.5068155835018908], 'brevity_penalty': 0.9912846225149247, 'length_ratio': 0.9913223817060525, 'translation_length': 18164, 'reference_length': 18323} {'rouge1': 0.7917720369498722, 'rouge2': 0.6755927340957304, 'rougeL': 0.7912377266650397, 'rougeLsum': 0.7913035186420103}
0.3842 3.0 855 0.3058 0.2090 {'bleu': 0.6605337548855493, 'precisions': [0.8175150519978106, 0.7074787393696849, 0.616208731142045, 0.5403591352859135], 'brevity_penalty': 0.99710327314591, 'length_ratio': 0.9971074605686842, 'translation_length': 18270, 'reference_length': 18323} {'rouge1': 0.8146357732958056, 'rouge2': 0.7071194587560558, 'rougeL': 0.814052758845498, 'rougeLsum': 0.8138241815295024}
0.3719 4.0 1140 0.2868 0.2010 {'bleu': 0.6712219662583173, 'precisions': [0.8250068549492734, 0.7188067932568779, 0.6293752283522105, 0.5544580419580419], 'brevity_penalty': 0.9951857415819062, 'length_ratio': 0.9951972930197021, 'translation_length': 18235, 'reference_length': 18323} {'rouge1': 0.8222029713967768, 'rouge2': 0.7197657000623603, 'rougeL': 0.8217923739918656, 'rougeLsum': 0.821719465931791}
0.3336 5.0 1425 0.2742 0.1959 {'bleu': 0.6785372107003715, 'precisions': [0.8300621254604431, 0.7259128904531457, 0.6393695014662757, 0.5666900666900667], 'brevity_penalty': 0.9926599804903764, 'length_ratio': 0.9926867870981826, 'translation_length': 18189, 'reference_length': 18323} {'rouge1': 0.8265645999560629, 'rouge2': 0.7247965384723267, 'rougeL': 0.8259582425920233, 'rougeLsum': 0.8259017198295564}
0.3442 6.0 1710 0.2642 0.1903 {'bleu': 0.6873788245362181, 'precisions': [0.832968476357268, 0.7314617981743153, 0.6461886022445708, 0.5733948950256991], 'brevity_penalty': 0.9972127357870568, 'length_ratio': 0.9972166130000546, 'translation_length': 18272, 'reference_length': 18323} {'rouge1': 0.8325100938158259, 'rouge2': 0.7340864170622121, 'rougeL': 0.83191579042451, 'rougeLsum': 0.8318541463934617}
0.3341 7.0 1995 0.2582 0.1843 {'bleu': 0.6963666431657629, 'precisions': [0.839673317254988, 0.7409495177251659, 0.6574662285505659, 0.5849270806043141], 'brevity_penalty': 0.9956791710107048, 'length_ratio': 0.9956884789608689, 'translation_length': 18244, 'reference_length': 18323} {'rouge1': 0.838537901118194, 'rouge2': 0.7427526814996277, 'rougeL': 0.8379557459558569, 'rougeLsum': 0.8379007055417265}
0.3172 8.0 2280 0.2494 0.1804 {'bleu': 0.7001144932835086, 'precisions': [0.843205001645278, 0.7451742291301078, 0.6613327487943884, 0.5895822408669813], 'brevity_penalty': 0.9951309011278089, 'length_ratio': 0.9951427168040168, 'translation_length': 18234, 'reference_length': 18323} {'rouge1': 0.8428458068475617, 'rouge2': 0.7473487117700717, 'rougeL': 0.8422200154587571, 'rougeLsum': 0.8420994714247132}
0.294 9.0 2565 0.2444 0.1784 {'bleu': 0.7015679264162828, 'precisions': [0.8458917284968398, 0.7474398441917447, 0.6644683813292299, 0.5931088900578643], 'brevity_penalty': 0.9929897872850882, 'length_ratio': 0.9930142443922938, 'translation_length': 18195, 'reference_length': 18323} {'rouge1': 0.8447332085359476, 'rouge2': 0.7490673320409132, 'rougeL': 0.8441531451631874, 'rougeLsum': 0.843987486458086}
0.2927 10.0 2850 0.2395 0.1734 {'bleu': 0.7094676103718882, 'precisions': [0.8488569705608245, 0.7529912923635909, 0.6714379610019718, 0.6010480349344978], 'brevity_penalty': 0.9955147214626567, 'length_ratio': 0.9955247503138133, 'translation_length': 18241, 'reference_length': 18323} {'rouge1': 0.8488302350839227, 'rouge2': 0.7554473763418516, 'rougeL': 0.848226337259566, 'rougeLsum': 0.8481517731134592}
0.2974 11.0 3135 0.2361 0.1757 {'bleu': 0.7073575226963291, 'precisions': [0.8489256198347107, 0.7540322580645161, 0.6735038964858109, 0.6032737833318665], 'brevity_penalty': 0.9905136020383278, 'length_ratio': 0.9905583146864596, 'translation_length': 18150, 'reference_length': 18323} {'rouge1': 0.8469188343317979, 'rouge2': 0.753826185237243, 'rougeL': 0.8462106498895938, 'rougeLsum': 0.8462690600047347}
0.2929 12.0 3420 0.2323 0.1710 {'bleu': 0.7104887814499381, 'precisions': [0.8529363110008271, 0.7578987198082866, 0.6768970339294914, 0.6069980609906575], 'brevity_penalty': 0.9896868547353578, 'length_ratio': 0.9897396714511816, 'translation_length': 18135, 'reference_length': 18323} {'rouge1': 0.851535800343566, 'rouge2': 0.7594371464363054, 'rougeL': 0.8507125679902645, 'rougeLsum': 0.8505546552290876}
0.2922 13.0 3705 0.2277 0.1690 {'bleu': 0.7147023714998528, 'precisions': [0.8544513001322168, 0.7611188106337408, 0.6811967068509261, 0.6115804294262583], 'brevity_penalty': 0.9906237838889417, 'length_ratio': 0.99066746711783, 'translation_length': 18152, 'reference_length': 18323} {'rouge1': 0.8532183741925291, 'rouge2': 0.7629739149453718, 'rougeL': 0.8526967418105296, 'rougeLsum': 0.8525344290554377}
0.2723 14.0 3990 0.2255 0.1665 {'bleu': 0.7185286342228578, 'precisions': [0.8572294801830108, 0.7651768265775705, 0.6859412933127345, 0.6166784078901022], 'brevity_penalty': 0.9900176348680043, 'length_ratio': 0.9900671287452928, 'translation_length': 18141, 'reference_length': 18323} {'rouge1': 0.8554844372155834, 'rouge2': 0.7665585766259185, 'rougeL': 0.8550678894072193, 'rougeLsum': 0.8549249072117735}
0.2618 15.0 4275 0.2216 0.1647 {'bleu': 0.7205920025415866, 'precisions': [0.8575904144223371, 0.7653304850464941, 0.6855488787923201, 0.6164635749978084], 'brevity_penalty': 0.9929348269851479, 'length_ratio': 0.9929596681766086, 'translation_length': 18194, 'reference_length': 18323} {'rouge1': 0.8572991962911986, 'rouge2': 0.7687939439030189, 'rougeL': 0.8569135698404339, 'rougeLsum': 0.8567069159809071}
0.2824 16.0 4560 0.2194 0.1625 {'bleu': 0.7248245517316763, 'precisions': [0.859585463741822, 0.7690277166739992, 0.6911516750971336, 0.6221929824561403], 'brevity_penalty': 0.9926599804903764, 'length_ratio': 0.9926867870981826, 'translation_length': 18189, 'reference_length': 18323} {'rouge1': 0.860740052677103, 'rouge2': 0.7738344427906813, 'rougeL': 0.8602549832284943, 'rougeLsum': 0.8601224055557424}
0.2584 17.0 4845 0.2184 0.1624 {'bleu': 0.7233957743502292, 'precisions': [0.8591843464878531, 0.7673410404624278, 0.689066393082222, 0.6201332865661171], 'brevity_penalty': 0.9929348269851479, 'length_ratio': 0.9929596681766086, 'translation_length': 18194, 'reference_length': 18323} {'rouge1': 0.8596520621812659, 'rouge2': 0.7716232895345339, 'rougeL': 0.8592810276384565, 'rougeLsum': 0.85911704018123}
0.2626 18.0 5130 0.2162 0.1575 {'bleu': 0.7321328546559496, 'precisions': [0.8624205218153914, 0.7733308280095202, 0.6966267523364486, 0.6292125021826436], 'brevity_penalty': 0.9956791710107048, 'length_ratio': 0.9956884789608689, 'translation_length': 18244, 'reference_length': 18323} {'rouge1': 0.8630838420026828, 'rouge2': 0.7773667756363078, 'rougeL': 0.8628126973557786, 'rougeLsum': 0.8627419660979692}
0.2518 19.0 5415 0.2132 0.1577 {'bleu': 0.7305988372304458, 'precisions': [0.8624547548535703, 0.7728127350213086, 0.6953821423352331, 0.6268461067901774], 'brevity_penalty': 0.9951309011278089, 'length_ratio': 0.9951427168040168, 'translation_length': 18234, 'reference_length': 18323} {'rouge1': 0.8638253584058928, 'rouge2': 0.7773261199114829, 'rougeL': 0.8632427380502744, 'rougeLsum': 0.8631565081641605}
0.2624 20.0 5700 0.2114 0.1561 {'bleu': 0.7336678313741997, 'precisions': [0.8624801706690006, 0.7734799725051553, 0.6966431224058837, 0.6291880602210426], 'brevity_penalty': 0.9977051698422599, 'length_ratio': 0.9977077989412214, 'translation_length': 18281, 'reference_length': 18323} {'rouge1': 0.8643711864157274, 'rouge2': 0.7789576656096648, 'rougeL': 0.864045843841154, 'rougeLsum': 0.8639432546192249}
0.274 21.0 5985 0.2125 0.1572 {'bleu': 0.7327413298256028, 'precisions': [0.8631249656989188, 0.7743210186288654, 0.6986762232136327, 0.6313303594856993], 'brevity_penalty': 0.9944177028031576, 'length_ratio': 0.9944332260001092, 'translation_length': 18221, 'reference_length': 18323} {'rouge1': 0.8637561560608497, 'rouge2': 0.7775626243197682, 'rougeL': 0.8633528770604967, 'rougeLsum': 0.8632085396165139}
0.2484 22.0 6270 0.2091 0.1552 {'bleu': 0.7357839881765593, 'precisions': [0.8648885962023927, 0.7774711490215755, 0.7019160450489981, 0.634892872759073], 'brevity_penalty': 0.9944725821787992, 'length_ratio': 0.9944878022157944, 'translation_length': 18222, 'reference_length': 18323} {'rouge1': 0.8659727708387619, 'rouge2': 0.7816164515395694, 'rougeL': 0.8657951715895931, 'rougeLsum': 0.8655840959367318}
0.2473 23.0 6555 0.2090 0.1554 {'bleu': 0.7356730614040227, 'precisions': [0.8642543859649123, 0.7763438165643403, 0.7012123867952089, 0.6340143181421337], 'brevity_penalty': 0.9954598989631315, 'length_ratio': 0.995470174098128, 'translation_length': 18240, 'reference_length': 18323} {'rouge1': 0.865456270958754, 'rouge2': 0.7804951440612873, 'rougeL': 0.8652647943705735, 'rougeLsum': 0.8649670956520807}
0.2418 24.0 6840 0.2055 0.1553 {'bleu': 0.7347926831245415, 'precisions': [0.8653296703296703, 0.7771008667252858, 0.7016554351010841, 0.634770417104802], 'brevity_penalty': 0.9932645437985937, 'length_ratio': 0.9932871254707198, 'translation_length': 18200, 'reference_length': 18323} {'rouge1': 0.8654752140097928, 'rouge2': 0.7807218611868961, 'rougeL': 0.8652326088546789, 'rougeLsum': 0.8650739915057691}
0.2577 25.0 7125 0.2040 0.1515 {'bleu': 0.7395907984479824, 'precisions': [0.8676938260774207, 0.7802005012531328, 0.7051132213294375, 0.6386026200873363], 'brevity_penalty': 0.9953502449878465, 'length_ratio': 0.9953610216667577, 'translation_length': 18238, 'reference_length': 18323} {'rouge1': 0.8685153389774867, 'rouge2': 0.7838568977155047, 'rougeL': 0.8683321242058704, 'rougeLsum': 0.8680531173155077}
0.2478 26.0 7410 0.2035 0.1523 {'bleu': 0.7389574395410246, 'precisions': [0.8673581385138843, 0.7797917711991972, 0.7048413046657891, 0.639496239286339], 'brevity_penalty': 0.9944725821787992, 'length_ratio': 0.9944878022157944, 'translation_length': 18222, 'reference_length': 18323} {'rouge1': 0.8684026564775361, 'rouge2': 0.7837147217327451, 'rougeL': 0.8680473992168228, 'rougeLsum': 0.86781442567829}
0.2345 27.0 7695 0.2036 0.1525 {'bleu': 0.7394120868432645, 'precisions': [0.8659697898423818, 0.7782293360010004, 0.7032206353832702, 0.6378124183575721], 'brevity_penalty': 0.9972127357870568, 'length_ratio': 0.9972166130000546, 'translation_length': 18272, 'reference_length': 18323} {'rouge1': 0.8680049443409963, 'rouge2': 0.7830505254725744, 'rougeL': 0.8677384137559521, 'rougeLsum': 0.8675358853961506}
0.2398 28.0 7980 0.2025 0.1513 {'bleu': 0.7410617179914926, 'precisions': [0.8681987713909609, 0.7813087626927416, 0.7069570301081555, 0.6415836392239119], 'brevity_penalty': 0.9950212112404683, 'length_ratio': 0.9950335643726465, 'translation_length': 18232, 'reference_length': 18323} {'rouge1': 0.8691516758267378, 'rouge2': 0.7855889417660622, 'rougeL': 0.8688010484175208, 'rougeLsum': 0.8685495828117757}
0.2595 29.0 8265 0.2005 0.1506 {'bleu': 0.7408702837830967, 'precisions': [0.8690567695179532, 0.7818775100401606, 0.7071564466559345, 0.6421881838074398], 'brevity_penalty': 0.9940334633019136, 'length_ratio': 0.9940511924903127, 'translation_length': 18214, 'reference_length': 18323} {'rouge1': 0.8705678413238229, 'rouge2': 0.7865174894976992, 'rougeL': 0.8703173243712404, 'rougeLsum': 0.8700254528394962}
0.224 30.0 8550 0.2009 0.1487 {'bleu': 0.7450849356742056, 'precisions': [0.8704333516182118, 0.7844784353059178, 0.7112995176143838, 0.6476140534871526], 'brevity_penalty': 0.9949115093798545, 'length_ratio': 0.994924411941276, 'translation_length': 18230, 'reference_length': 18323} {'rouge1': 0.8713511668656566, 'rouge2': 0.7878011114559871, 'rougeL': 0.8710130200505186, 'rougeLsum': 0.8707877973869348}
0.2218 31.0 8835 0.2005 0.1492 {'bleu': 0.7446394749315683, 'precisions': [0.8707180500658761, 0.7851047810264776, 0.7116622768510389, 0.6470073503675183], 'brevity_penalty': 0.9941432609952691, 'length_ratio': 0.9941603449216831, 'translation_length': 18216, 'reference_length': 18323} {'rouge1': 0.871600980620826, 'rouge2': 0.7896333858185615, 'rougeL': 0.8713249999389157, 'rougeLsum': 0.8710035822779216}
0.2249 32.0 9120 0.2002 0.1496 {'bleu': 0.7428408162780507, 'precisions': [0.8707584007039543, 0.7850361521534108, 0.7109644297763109, 0.646134947793279], 'brevity_penalty': 0.9923300656862052, 'length_ratio': 0.9923593298040714, 'translation_length': 18183, 'reference_length': 18323} {'rouge1': 0.871521799540266, 'rouge2': 0.7886835586110674, 'rougeL': 0.8711811073148175, 'rougeLsum': 0.8711031407486098}
0.2258 33.0 9405 0.2008 0.1489 {'bleu': 0.7436253569604943, 'precisions': [0.8706385780118499, 0.7836990595611285, 0.7097222222222223, 0.6447552447552447], 'brevity_penalty': 0.9948017955446742, 'length_ratio': 0.9948152595099056, 'translation_length': 18228, 'reference_length': 18323} {'rouge1': 0.8721419286921401, 'rouge2': 0.7882963455000708, 'rougeL': 0.8717333624704182, 'rougeLsum': 0.8716064100420517}
0.2301 34.0 9690 0.1997 0.1469 {'bleu': 0.7477380046910622, 'precisions': [0.871839868384974, 0.78724070940653, 0.7143274640169504, 0.6500393116100288], 'brevity_penalty': 0.9951857415819062, 'length_ratio': 0.9951972930197021, 'translation_length': 18235, 'reference_length': 18323} {'rouge1': 0.8733291718803134, 'rouge2': 0.7912828262461826, 'rougeL': 0.8730615834810505, 'rougeLsum': 0.8729508479014352}
0.2352 35.0 9975 0.1989 0.1474 {'bleu': 0.746786327866007, 'precisions': [0.8721523851347642, 0.7870004391743523, 0.7138049601287585, 0.6497506343512118], 'brevity_penalty': 0.9941981553480093, 'length_ratio': 0.9942149211373683, 'translation_length': 18217, 'reference_length': 18323} {'rouge1': 0.873008906192879, 'rouge2': 0.790428630395299, 'rougeL': 0.8727160101378854, 'rougeLsum': 0.872513685033558}
0.2368 36.0 10260 0.1987 0.1481 {'bleu': 0.7450258680082317, 'precisions': [0.8723720418271876, 0.7871255977850491, 0.7142123036264866, 0.6497408416059035], 'brevity_penalty': 0.9916148795787761, 'length_ratio': 0.9916498390001637, 'translation_length': 18170, 'reference_length': 18323} {'rouge1': 0.872557108952186, 'rouge2': 0.7899616699266603, 'rougeL': 0.8723067110327652, 'rougeLsum': 0.8722357990611533}
0.2387 37.0 10545 0.1987 0.1472 {'bleu': 0.7461895367214819, 'precisions': [0.871914426769062, 0.7861083249749248, 0.7125420260195878, 0.6478762454116413], 'brevity_penalty': 0.9949115093798545, 'length_ratio': 0.994924411941276, 'translation_length': 18230, 'reference_length': 18323} {'rouge1': 0.8734065376381056, 'rouge2': 0.7908717014566278, 'rougeL': 0.8730576045837826, 'rougeLsum': 0.8729880391818262}
0.2198 38.0 10830 0.1984 0.1460 {'bleu': 0.7480290753832786, 'precisions': [0.8731273665148439, 0.7880840388836626, 0.7149542961608775, 0.6505465675557499], 'brevity_penalty': 0.9945274585595061, 'length_ratio': 0.9945423784314795, 'translation_length': 18223, 'reference_length': 18323} {'rouge1': 0.8746790367634547, 'rouge2': 0.7924947185694857, 'rougeL': 0.8743742574864739, 'rougeLsum': 0.8741497830000428}
0.2261 39.0 11115 0.1981 0.1461 {'bleu': 0.7482856703011707, 'precisions': [0.8729975861312267, 0.7880877742946708, 0.7149853801169591, 0.6507867132867133], 'brevity_penalty': 0.9948017955446742, 'length_ratio': 0.9948152595099056, 'translation_length': 18228, 'reference_length': 18323} {'rouge1': 0.8746130363164816, 'rouge2': 0.7924162570451201, 'rougeL': 0.8742493430241225, 'rougeLsum': 0.8740282212067312}
0.2274 39.8604 11360 0.1983 0.1456 {'bleu': 0.7492044741492486, 'precisions': [0.8730063030967389, 0.7883760255527025, 0.7152661166678835, 0.6510430304617264], 'brevity_penalty': 0.9957339815437508, 'length_ratio': 0.995743055176554, 'translation_length': 18245, 'reference_length': 18323} {'rouge1': 0.8748364322650914, 'rouge2': 0.7928723605220098, 'rougeL': 0.8745015805780032, 'rougeLsum': 0.8743113489400196}

Framework versions

  • Transformers 4.49.0
  • Pytorch 2.6.0+cu124
  • Datasets 3.2.0
  • Tokenizers 0.21.0
Downloads last month
59
Safetensors
Model size
965M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ilyes25/wav2vec2-large-mms-1b-DZ

Finetuned
(268)
this model

Space using ilyes25/wav2vec2-large-mms-1b-DZ 1