HUANG1993
/

GreedRL-VRP-pretrained-v1

Reinforcement Learning

Deep Reinforcement Learning

Combinatorial Optimization

Reinforcement Learning

Vehicle Routing Problem

Model card Files Files and versions Community

HUANG1993 commited on Apr 21, 2023

Commit

1d90dd1

•

1 Parent(s): 607b107

Update README.md

Browse files

Files changed (1) hide show

README.md +15 -39

README.md CHANGED Viewed

@@ -11,61 +11,37 @@ tags:
   ![](./images/GREEDRL-Logo-Original-640.png)
-# ✊‍GreedRL
-## 🏆Award
 ## Introduction
-  *Combinatorial Optimization Problems(COPs)* has long been an active field of research. Generally speaking, there exists two main approachs for solving COPs, each of them having pros and cons. On the one hand, the *exact algorithms* can find the optimal solution, but they may be prohibitive for solving large instances because of the exponential increate of the execution time.
   On the other hand, *heuristic algorithms* can compute solutions efficiently, but are not able to prove the optimality of solutions.
-  In the realistic business scenarios, some COPs are large-scale(>=1000 nodes), which have very strict requirements for the execution time and quality of solutions. To better solve these problems, we
-  proposes a generic and complete solver, called **GreedRL**, based on Deep Reinforcement Learning(DRL), which achieves better speed and quality of solutions than *heuristic algorithms* .
 * **GENERAL**
-  The GreedRL solver makes **a high level of abstraction for Combinatorial Optimization** problems, which can solve various VRP, Batching, Scheduling and Online Assignment problems. At the same time, for the VRPs, it also supports variants of VRPs with different constraints, such as Time-Window, Pickup-Delivery, Split-Delivery, Multi-Vehicles, etc.
 * **HIGH-PERFORMANCE**
-  We have improved the RL environment simulation speed by **CUDA and C++ implementations**. At the same time, some operators are costomized to replace the native operators of pytorch, to achive the ultimate computing performance.
 * **USER-FRIENDLY**
-  We have **warped commonly used modules**, such as NN components, RL training algothrim and CO constraints implementations, which makes GreedRL easy to use.
-## Architecture design
-The entire architecture is divided into three layers:
-* **High-performance Env framework**
-    The constraints and optimization objectives for the problems to be solved are defined in the Reinforcement Learning(RL) Environment(Env).
-    Based on performance and ease of use considerations, the Env framework provides two implementations:one based on **pytorch** and one based on **CUDA C++**.
-    To facilitate the definition of problems for developers, the framework abstracts multiple variables to represent the environment's state, which are automatically generated after being declared by the user. When defining constraints and optimization objectives, developers can directly refer to the declared variables.
-    Currently, various VRP variants such as CVRP, VRPTW and PDPTW, as well as problems such as Batching, are supported.
-* **Pluggable NN components**
-    The framework provides certain neural network(NN) components, and developers can also implement custom neural network components.
-* **High-performance NN operators**
-    In order to achieve the ultimate performance, the framework implements some high-performance operators specifically for Combinatorial Optimization(CO) problems to replace pytorch operators, such as the Masked Addition Attention and Masked Softmax Sampling."
-    ![](./images/GREEDRL-Framwork.png)
-## Network design
-The neural network adopts the Seq2Seq architecture commonly used in Natural Language Processing(NLP), with the Transformer used in the Encoding part and RNN used in the decoding part, as shown in the diagram below.
-  ![](./images/GREEDRL-Network.png)
-## Modeling examples
 ### Capacitated Vehicle Routing Problem (CVRP)
@@ -577,7 +553,7 @@ class Objective:
 #
-# 🤠GreedRL-CVRP-pretrained model
 ## Model description

   ![](./images/GREEDRL-Logo-Original-640.png)
+# 🤠GreedRL
 ## Introduction
+  ***Combinatorial Optimization Problems(COPs)*** has long been an active field of research. Generally speaking, there exists two main approachs for solving COPs, each of them having pros and cons. On the one hand, the *exact algorithms* can find the optimal solution, but they may be prohibitive for solving large instances because of the exponential increate of the execution time.
   On the other hand, *heuristic algorithms* can compute solutions efficiently, but are not able to prove the optimality of solutions.
+  In the realistic business scenarios, COPs are usually large-scale(>=1000 nodes), which have very strict requirements for the execution time and performance of solutions. To better solve these problems, we
+  proposes a generic and complete solver, named **🤠GreedRL**, based on **Deep Reinforcement Learning(DRL)**, which achieves better speed and performance of solutions than *heuristic algorithms* .
+## 🏆Award
+## Main features
 * **GENERAL**
+  🤠GreedRL makes **a high level of abstraction for COPs**, which can solve various types of problems, such as Vehicle Routing Problems(VRPs), Batching, Scheduling and Online Assignment problems. At the same time, for the VRPs, it also supports variants of VRPs with different constraints, such as Time-Window, Pickup-Delivery, Split-Delivery, Multi-Vehicles, etc.
 * **HIGH-PERFORMANCE**
+  🤠GreedRL have improved the DRL environment(Env) simulation speed by **CUDA and C++ implementations**. At the same time, we have also implemented some **Operators** to replace the native operators of PyTorch, like *Masked Matrix Multiplication* and *Masked Additive Attention*, to achive the ultimate computing performance.
 * **USER-FRIENDLY**
+  🤠GreedRL have **warped commonly used modules**, such as Neural Network(NN) components, RL training algothrim and COPs constraints implementations, which makes it easy to use.
+## Architecture
+  ![](./images/GREEDRL-Framwork.png)
+## COPs Modeling examples
 ### Capacitated Vehicle Routing Problem (CVRP)
 #
+# GreedRL-CVRP-pretrained model
 ## Model description