Commit Graph

3 Commits (6de135680eca81353892abe4f6d34c2c04f7dd5d)

Author SHA1 Message Date
mgjeon 6de135680e feat: Enhance action selection and Q-table initialization
Key changes:
- Add random Q-table initialization with small values (0-0.1)
- Implement action masking mechanism to prevent repeated actions
- Add debug information to show available actions and Q-values
- Add epsilon-greedy selection with action masking
- Add tests for policy and agent behavior
2025-09-24 16:51:44 +09:00
mgjeon a81e1d4232 feat: 학습 및 평가 프로세스 개선
- data_collector.py: JSON 형식 로깅 추가
- train_offline.py: Q-table 저장 형식 개선
- evaluate.py: 평가 지표 상세화
- usecases/: 평가 로직 개선
- tests/: 테스트 케이스 업데이트
2025-09-22 16:36:07 +09:00
mgjeon 0c2ec47c6b Initial commit 2025-09-18 08:46:44 +09:00