'분류 전체보기' 카테고리의 글 목록 (9 Page)

728x90

분류 전체보기 138

아래의 코드는 기존의 gym library를 이용한 경우지만, custom Env의 경우는 이전 글을 참조하면 된다. https://tw0226.tistory.com/80 import gym from stable_baselines3 import DQN from stable_baselines3.common.evaluation import evaluate_policy # Create environment env = gym.make('LunarLander-v2') # Instantiate the agent model = DQN('MlpPolicy', env, verbose=1) # Train the agent model.learn(total_timesteps=int(2e5)) # Save the agent m..

AI | ML/Reinforcement Learning 2021.07.13

[RL] 강화학습 모델 라이브러리 stable-baselines3 사용해보기

링크 : https://github.com/DLR-RM/stable-baselines3 DLR-RM/stable-baselines3 PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms. - DLR-RM/stable-baselines3 github.com 소개 강화학습을 학습시키려고 하면 모델들마다 사용법이 각각 달라서 많은 에러를 겪곤 한다. 하지만, stable-baselines3 라이브러리를 이용하면 아래의 장점이 생긴다. 1. 다양하게 구현된 모델들을 사용할 수 있고, 2. 경우에 따라 multi-processing 까지 지원된다. 일일이 모델마다 구현할 필요가 없어진다는 의..

AI | ML/Reinforcement Learning 2021.07.13

[CUDA] 윈도우 환경에서 CUDA 버전 변경

어떤 github의 딥러닝 코드는 tensorflow 1.14 버전을 썼는데, 1.14 버전은 CUDA 11버전이 호환되지 않아서 1. python 버전 변경 2. tensorflow 재설치 3. CUDA 버전 번경 (11 -> 10) 의 절차가 필요하였다. 나의 상황은 다음과 같고, 하나씩 적용해봤다. GPU : 2060 RTX OS : Windows 10 CUDA : 11.0 Python : 3.8 1. Python 버전 변경 다행히도 Anaconda를 쓰고 있어서, conda 환경으로 3.6 을 새로 만들어서 쉽게 해결하였다. 2. tensorflow 재설치 마찬가지로, conda 환경으로 새로 설치하고 나니 pip install tensorflow-gpu==1.14.0 으로 쉽게 해결하였다. 3...

AI | ML/AI 개발 | CUDA 2021.07.09

[AWS] Sagemaker endpoint job delete

sagemaker 에서 endpoint구성까지 완료한 뒤 삭제를 눌렀더니 아래의 에러가 발생하였고, ValidationException The Endpoint currently has one or more MonitoringSchedules. Please delete the MonitoringSchedules before deleting the Endpoint. 다음과 같이 조취하였다. 1 .aws sagemaker list-monitoring-schedules --region -> monitoring--schedule-name 이 나온다 2. aws sagemaker delete-monitoring-schedule --monitoring-schedule-name Built --region 3. aws ..

DevOps/AWS 2021.07.04

애플워치로 HRV 데이터 추출하기

HRV란 Heart Rate Variability의 약자로, 심박변이도라고 말하기도 한다. 쉽게 생각하면 심장은 항상 안정적으로 뛰기 때문에 변이가 적을수록 건강한 것이 아닌가 싶겠지만, 심장은 오히려 상황에 따라 변화하면서 불규칙하게 뛰기 때문에 건강할 사람일 수록 HRV가 더 높다고 한다. 검색해보던 중에 애플워치를 착용하고 있던 나에게 애플워치로도 HRV를 추출할 수 있다고 해서 검색해보고 시도해봤다. 1. 애플워치 데이터 추출하기 애플워치 공식홈페이지에 따라 행동하면 건강 데이터를 추출해낼 수 있다. https://support.apple.com/ko-kr/guide/iphone/iph27f6325b2/ios 애플워치에서 얻을 수 있는 생체 정보는 다음과 같다. 분당 심장박동수 휴식 시 심장박동수..

Programming Language/Python 2021.07.02

[PyTorch] RuntimeError: view size is not compatible with input tensor's size and stride (at least one dimension spans across two contiguous subspaces)

원인은 torch.view 에서 생기는 에러로 나의 경우는 .cuda()를 이용해 GPU 사용시 문제가 생겼다 해결방법 x.view(-1, ) -> x.contiguous().view(-1, )로 진행하면 해결된다

AI | ML/코드 2021.06.28

[PyTorch] RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation 에러

에러명은 아래와 같았다. RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.FloatTensor [64, 4]], which is output 0 of SoftmaxBackward, is at version 64; expected version 0 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True). 원인을 자세히 보면, inplace opera..

AI | ML/코드 2021.06.28

[Keras] 'str' object has no attribute 'decode'

model.load_weights(path) 를 했을 때 생기는 에러로 에러명은 다음과 같이 나온다. 'str' object has no attribute 'decode' 원인 : 나의 경우는 h5py 버전이 3.1.0이였다 해결방법 : 2.10.0 버전으로 변경하면 된다. pip install h5py==2.10.0

AI | ML/코드 2021.06.22

[gym] NotImplementedError 에러 해결방법

더보기 File "C:\Users\tw\anaconda3\lib\site-packages\gym\core.py", line 264, in reset observation = self.env.reset(**kwargs) File "C:\Users\tw\anaconda3\lib\site-packages\gym\core.py", line 265, in reset return self.observation(observation) File "C:\Users\tw\anaconda3\lib\site-packages\gym\core.py", line 272, in observation raise NotImplementedError NotImplementedError 에러원인 : gym 버전이 맞지 않아서 생기는 오류 ..

AI | ML/Reinforcement Learning 2021.06.10

Asynchronous Methods for Deep Reinforcement Learning (A3C 리뷰)

논문 소개 ‌ 링크 : https://arxiv.org/pdf/1602.01783.pdf‌ github : https://github.com/ikostrikov/pytorch-a3c ‌ Main Contributions‌ Experience replay 기반 RL 대신, 비동기 병렬 방식을 이용한 새로운 패러다임 제시 GPU 기반의 학습보다, 병렬 처리 기반 CPU 연산으로 학습 속도, 안정성 개선‌ 1. Intro 요약‌ Experience replay 기반의 RL은 메모리와 연산이 더 크고, off-policy여서 업데이트 과정도 필요 본 논문에서는 비동기 병렬 방식을 이용해 새로운 패러다임을 제시 프로세스에서 다양한 스펙트럼이 생기고, Q-Learning 과 같은 다양한 off-policy 알고리즘..

AI | ML/Reinforcement Learning 2021.06.10

1 ··· 6 7 8 9 10 11 12 ··· 14

공부, 그리고 기록

백준 알고리즘, Unity ML-Agent, Data Science, 테트리스 DQN, Python, 몬스테라 분갈이, efficientDet, 강화학습 DQN, Windows, 파이썬 알고리즘, 알고리즘, ONNX, Python GIL, 브루트포스, python flask, Brute-force, C++, 딥러닝, 테트리스 강화학습, matplotlib,

Today :
Yesterday :

728x90

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

깜태

분류 전체보기 138

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역

2025. 04
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30