frankliu666 commited on
Commit
f191bae
1 Parent(s): df19c59

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -3
README.md CHANGED
@@ -1,3 +1,42 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data
2
+
3
+ Paper: https://arxiv.org/abs/2401.13223
4
+
5
+ Code: https://github.com/fengbinzhu/TAT-LLM
6
+
7
+
8
+ ## Introduction
9
+
10
+ We present TAT-LLM, a specialized language model crafted through the innovative Step-wise Pipeline approach, focusing on the nuanced realm of tabular and textual question answering (QA). This model is the fruit of rigorously fine-tuning the LLaMA 2 architecture with a novel dataset, autonomously generated from expertly annotated resources. TAT-LLM stands at the intersection of tabular comprehension and textual analysis, engineered to excel by embodying three fundamental phases: Extraction, Reasoning, and Execution. Our empirical findings illuminate TAT-LLM's remarkable capability to eclipse traditional benchmarks, surmounting even the most advanced models and colossal language models such as GPT-4 across a suite of demanding QA tasks like FinQA, TAT-QA, and TAT-DQA. This endeavor not only sets a new standard for task-specific language models but also paves the way for future explorations in optimizing smaller models for highly specialized functions.
11
+
12
+ | Model | Size | FINQA | TATQA | TATDQA |
13
+ | --- | --- | --- | --- | --- |
14
+ | GPT-3.5-Turbo | - | 58.00 | 59.47 | 52.74 |
15
+ | GPT-4 | - | 63.91 | 71.92 | 64.46 |
16
+ | TAT-LLM-7B | 7B | 65.13 | 76.49 | 71.38 |
17
+ | TAT-LLM-13B | 13B | 71.93 | 77.51 | 72.22 |
18
+ | TAT-LLM-70B | 70B | **76.81** | **81.42** | **76.55** |
19
+
20
+
21
+ ## Training
22
+
23
+ We train our TAT-LLM model in various sizes, including 7B, 13B, and 70B, by fine-tuning LLaMA 2 using Low-Rank Adaptation (LoRa) on a combination of the train sets from FinQA, TAT-QA and TAT-DQA datasets. To refine accuracy, we introduce an External Executor, enhancing the model by processing intermediate outputs to derive conclusive answers. Please refer to the [paper](https://arxiv.org/abs/2401.13223) for more details.
24
+
25
+ ## Inference & Evaluation
26
+
27
+ Please refer to code [here](https://github.com/fengbinzhu/TAT-LLM)
28
+
29
+ ## Citation
30
+
31
+ If you find this repository helpful, please consider citing our paper:
32
+
33
+ ```
34
+ @misc{zhu2024tatllm,
35
+ title={TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data},
36
+ author={Fengbin Zhu and Ziyang Liu and Fuli Feng and Chao Wang and Moxin Li and Tat-Seng Chua},
37
+ year={2024},
38
+ eprint={2401.13223},
39
+ archivePrefix={arXiv},
40
+ primaryClass={cs.CL}
41
+ }
42
+ ```