Spaces:

Wauplin
/

jssp_openenv

Sleeping

App Files Files Community

jssp_openenv / README.md

Wauplin HF Staff

there was a bug...

6e3f176 verified 29 days ago

preview code

raw

history blame contribute delete

5.5 kB

metadata

title: JSSP OpenEnv
emoji: ⏰
colorFrom: green
colorTo: purple
sdk: docker
pinned: false

jssp_openenv

Try it live on Hugging Face Spaces

Job shop scheduling problem (JSSP)

The Job Shop Scheduling Problem (JSSP) is a classic optimization problem in operations research. Given a set of jobs, each consisting of multiple operations that must be performed in a specific sequence, and a set of machines, the goal is to schedule the operations on machines to minimize the total completion time (makespan).

Key constraints:

Each job consists of a sequence of operations that must be completed in order
Each operation requires a specific machine for a given duration
Each machine can process only one operation at a time
Once started, an operation cannot be interrupted

This implementation uses the OpenEnv framework to create a reinforcement learning environment where an agent (policy) learns to make scheduling decisions at each time step.

!TIP For now, we only implement and run the FT06 problem. It is a well-known problem in the literature with a known optimal solution. Goal for training is to run arbitrary random environments.

OpenEnv

OpenEnv is a framework from Meta PyTorch and Hugging Face for building reinforcement learning environments. It provides:

A standardized interface for environments with Action and Observation models
A web-based interface for interactive exploration of environments
A client-server architecture for distributed training and evaluation
Integration with LLM-based policies for solving complex problems

This project implements a JSSP environment using OpenEnv, allowing you to:

Interact with the environment through a web interface
Test different scheduling policies (FIFO, Max-Min, LLM-based)
Train reinforcement learning agents to solve JSSP instances

Project Architecture

The project follows a client-server architecture using the OpenEnv framework:

Core Components

Models (src/jssp_openenv/models.py):

JSSPAction: Represents scheduling actions (list of job IDs to schedule)
JSSPObservation: Contains the current state (machine status, job status, remaining operations)

Environment (src/jssp_openenv/server/jssp_environment.py):

JSSPEnvironment: The core simulation environment that:
- Manages job progress and machine states
- Validates actions and enforces constraints
- Advances simulation time using SimPy
- Returns observations and rewards

Client (src/jssp_openenv/client.py):

JSSPEnvClient: HTTP client that communicates with the environment server
Handles action serialization and observation parsing

Policies (src/jssp_openenv/policy.py):

JSSPEnvPolicy: Abstract base class for scheduling policies
JSSPFifoPolicy: First-In-First-Out scheduling (schedules jobs by ID order)
JSSPMaxMinPolicy: Max-Min scheduling (prioritizes longest operations)
JSSPLLMPolicy: LLM-based scheduling using OpenAI-compatible APIs

Solver (src/jssp_openenv/solver.py):

solve_jssp(): Orchestrates the solving process by:
- Resetting the environment
- Iteratively applying policy actions
- Tracking scheduled events for visualization
- Returning makespan and event history

Visualization (src/jssp_openenv/gantt.py):

Generates Gantt charts showing the schedule timeline

How to use

Install

Install the package and its dependencies:

pip install -e .

For development with additional tools (pytest, ruff, etc.):

pip install -e ".[dev]"

Note: For LLM-based policies, you'll need to set the HF_TOKEN environment variable with your Hugging Face API token:

export HF_TOKEN=your_token_here

Run server

To play with the environment locally, run

python app.py

and go to http://0.0.0.0:8000/web.

Run policy

FIFO policy (always run first available job):

python run.py fifo

Max-Min policy (always run longest job first):

python run.py maxmin

LLM policy (ask an LLM to solve the problem)

python run.py llm --model-id "openai/gpt-oss-20b:groq"
python run.py llm --model-id "openai/gpt-oss-120b:cerebras"
python run.py llm --model-id "Qwen/Qwen3-32B:groq"

Check results

The solver will resolve the problem using the policy and then plot a gantt chart of the solution in the ./charts folder.

Here is an example:

Current Results

Results as of Nov. 7, 2024 on FT06 problem instance. Note: Non-scientific results, only ran 1 episode per policy.

Policy	Makespan
Optimal solution	55
`openai/gpt-oss-20b:groq`	61
FIFO	68
`openai/gpt-oss-120b:cerebras`	69
`Qwen/Qwen3-32B:groq`	69
Max-Min	77

Run with docker

Build the Docker image:

docker build -t jssp-openenv .

Run the container:

docker run -p 7860:7860 jssp-openenv

The web interface will be available at http://localhost:7860/web.

TODO

run on other example environments (FT10, FT20)
run on random environments
run multiple policies and summarize results
trainer