llama-3-reactor / README.md
Corianas's picture
Create README.md
bef2eac verified
metadata
license: apache-2.0
language:
  - en

This is a Re-act style model trained from meta-llama/Meta-Llama-3-8B

Dataset was parsed with:

def extract_trajectory_info(data):
    """
    Extracts the question, thoughts, actions, and observations from the trajectory field of the data.

    Parameters:
    data (dict): The data entry containing the trajectory field.

    Returns:
    dict: A dictionary containing the extracted question, thoughts, actions, and observations.
    """
    # Extracting the question
    question = data.get('question', '')

    # Extracting thoughts, actions, and observations using regex
    thoughts = re.findall(r'Thought \d+: (.+?)(?=Action|\Z)', data.get('trajectory', ''), re.DOTALL)
    actions = re.findall(r'Action \d+: (.+?)(?=Observation|\Z)', data.get('trajectory', ''), re.DOTALL)
    observations = re.findall(r'Observation \d+: (.+?)(?=Thought|\Z)', data.get('trajectory', ''), re.DOTALL)

    # Cleaning up the extracted data
    thoughts = [thought.strip() for thought in thoughts]
    actions = [action.strip() for action in actions]
    observations = [observation.strip() for observation in observations]

    return {
        'question': question,
        'thoughts': thoughts,
        'actions': actions,
        'observations': observations
    }
# Sample data
extracted_info = extract_trajectory_info(ds["train"][0])

Then remade into a new dataset with

# Predefine the instructions for the task
preamble = """Tools available:
(1) Search[entity], which searches the exact entity on Wikipedia and returns the first paragraph if it exists. If not, it will return some similar entities to search.
(2) Lookup[keyword], which returns the next sentence containing the keyword in the current passage.
(3) Finish[answer], which returns the answer and finishes the task.
"""
dataset = []
# Iterate through a specified number of examples in the training set
for i in range(len(ds['train'])):
    extracted_info = extract_trajectory_info(ds['train'][i])

    # Iterate through each thought in the extracted information
    for j in range(len(extracted_info['thoughts'])):
        out = f"{preamble}---\nQuestion: {extracted_info['question']}\n"
        prev = ""
        # Construct output for the first thought
        if j == 0:
            out += f"Thought: {extracted_info['thoughts'][0]}\n"
            out += f"Action: {extracted_info['actions'][0]}\nPAUSE\n\n\n\n"

        else:
            for k in range(1, j + 1):
                # Use appropriate indexing to avoid out-of-bounds errors
                prev += f"Thought:{extracted_info['thoughts'][j - k]}\n"
                prev += f"Action: {extracted_info['actions'][j - k]}\nPAUSE\n"

                prev += f"Observation: {extracted_info['observations'][j - k]}\n"

            out += prev  # Remove trailing space
            out += f"---\nThought: {extracted_info['thoughts'][j]}\n"
            out += f"Action: {extracted_info['actions'][j]}\nPAUSE\n\n\n\n"

        # Print the constructed output
        print(out)
        dataset.append(out)
        #print(len(out))