piliguori commited on
Commit
7d74feb
Β·
verified Β·
1 Parent(s): 92d01da

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -9
README.md CHANGED
@@ -33,17 +33,16 @@ This organization hosts resources from multiple research projects and publicatio
33
 
34
  ### Datasets for Security & Software Engineering
35
  - **PyResBugs** – 5,007 residual Python bugs with NL descriptions
36
- - **Shellcode_IA32** – The largest curated dataset of IA-32 shellcode snippets (3 versions)
37
  - **PoisonPy** – Dataset supporting targeted data-poisoning attacks
38
  - **Human vs AI Code** – Defects, vulnerabilities, and complexity analysis at scale
39
- - **EVIL datasets** – Exploit generation datasets (assembly & Python)
40
 
41
- ### Robustness, Poisoning & Exploit Generation
42
- - **Offensive Code Generation Robustness** – Data augmentation framework
43
- - **Context-Aware Exploits** – Benchmark for NL-to-exploit generation
44
- - **AI Code Generator Poisoning** – Targeted poisoning pipelines and evaluation
45
 
46
- All repositories include code, experimental scripts, datasets, and reproducibility materials.
47
 
48
  ---
49
 
@@ -52,7 +51,7 @@ All repositories include code, experimental scripts, datasets, and reproducibili
52
  Our work spans four interconnected areas:
53
 
54
  1. **Security of AI-generated Code**
55
- Vulnerability detection, automated patching, exploit generation, robustness testing.
56
 
57
  2. **Trustworthy LLM Evaluation**
58
  Correctness, equivalence checking, symbolic execution, reproducible benchmarks.
@@ -78,7 +77,6 @@ A non-exhaustive list includes works presented at:
78
  - **Information and Software Technology (IST)**
79
  - **Automated Software Engineering (AUSE)**
80
  - **Journal of Systems and Software (JSS)**
81
- - **NLP4Prog Workshop**
82
 
83
  Full references are available inside each corresponding repository.
84
 
 
33
 
34
  ### Datasets for Security & Software Engineering
35
  - **PyResBugs** – 5,007 residual Python bugs with NL descriptions
36
+ - **Shellcode_IA32** – The largest curated dataset of IA-32 shellcode snippets
37
  - **PoisonPy** – Dataset supporting targeted data-poisoning attacks
38
  - **Human vs AI Code** – Defects, vulnerabilities, and complexity analysis at scale
 
39
 
40
+ ### Robustness, Data Quality & Industrial Code Generation
41
+ - **Residual Bug Generation from Natural Language** – Frameworks for generating realistic residual defects from NL descriptions
42
+ - **Impact of Data Quality on Code Models** – Empirical studies on robustness, poisoning resilience, and dataset quality
43
+ - **Industrial Code Generation** – Models for domain-specific code synthesis (e.g., VHDL generation from natural language)
44
 
45
+ Our repositories include code, experimental scripts, datasets, and reproducibility materials.
46
 
47
  ---
48
 
 
51
  Our work spans four interconnected areas:
52
 
53
  1. **Security of AI-generated Code**
54
+ Vulnerability detection, automated patching, exploit generation, and robustness testing.
55
 
56
  2. **Trustworthy LLM Evaluation**
57
  Correctness, equivalence checking, symbolic execution, reproducible benchmarks.
 
77
  - **Information and Software Technology (IST)**
78
  - **Automated Software Engineering (AUSE)**
79
  - **Journal of Systems and Software (JSS)**
 
80
 
81
  Full references are available inside each corresponding repository.
82