Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mining tasks: running but not finishing #175

Open
ThomasBlock opened this issue Nov 23, 2024 · 0 comments
Open

mining tasks: running but not finishing #175

ThomasBlock opened this issue Nov 23, 2024 · 0 comments

Comments

@ThomasBlock
Copy link

ThomasBlock commented Nov 23, 2024

the new mining task for ECP seems to be running. @Normalnoise
but the task list shows terminated, so i guess do not collect rewards?
there is no "task finished" message
( maybe one of the errors in the log is to blame? )

here my ECP worker:

[GIN] 2024/11/23 - 13:20:32 | 200 |    1.070928ms |   67.215.11.182 | GET      "/api/v1/computing/cp"
/home/user/go-computing-provider/internal/computing/scan_task_payment.go:167 record not found
[0.103ms] [rows:0] SELECT * FROM `t_scan_chain` WHERE `t_scan_chain`.`id` = 1 ORDER BY `t_scan_chain`.`id` LIMIT 1time="2024-11-23 13:23:25.357" level=info msg="collect hard
ware resource, freeCpu:24, freeMemory: 31.33 GiB, freeStorage: 611.86 GiB, freeGpu: map[NVIDIA A4000:1]" func=reportClusterResourceForDocker file="ubi_service.go:1084"
time="2024-11-23 13:23:25.970" level=error msg="failed to parse resp body: {\"code\":400,\"msg\":\"record not found\"}" func=checkAgain file="scan_task_payment.go:144"
time="2024-11-23 13:23:26.053" level=error msg="failed to parse resp body: {\"code\":400,\"msg\":\"record not found\"}" func=checkAgain file="scan_task_payment.go:144"
time="2024-11-23 13:23:26.137" level=error msg="failed to parse resp body: {\"code\":400,\"msg\":\"record not found\"}" func=checkAgain file="scan_task_payment.go:144"
time="2024-11-23 13:23:26.221" level=error msg="failed to parse resp body: {\"code\":400,\"msg\":\"record not found\"}" func=checkAgain file="scan_task_payment.go:144"
time="2024-11-23 13:23:26.305" level=error msg="failed to parse resp body: {\"code\":400,\"msg\":\"record not found\"}" func=checkAgain file="scan_task_payment.go:144"

time="2024-11-23 13:25:45.549" level=info msg="Job received Data: {UUID:332b4c35-eb43-4ed4-89d9-11ab1636ab6f Name:332b4c35-eb43-4ed4-89d9-11ab1636ab6f Image:swanchain254/aleo_mainnet_f2pool:3.0.12 Envs:map[ACCOUNTNAME:swanubi MINER_URL:stratum+ssl://aleo-asia.f2pool.com:4420 WORKERNAME:112861732364630] Resource:{CPU:4 Memory:17179869184 Storage:21474836480 GPU:1 GPUModel:NVIDIA A4000} Price: Duration:1200}" func=DeployJob file="ecp_image_service.go:71"

docker ps -a
CONTAINER ID   IMAGE                                     COMMAND                  CREATED          STATUS                      PORTS     NAMES
b57a01c859ee   swanchain254/aleo_mainnet_f2pool:3.0.12   "/usr/bin/bash start…"   7 minutes ago    Up 7 minutes                          332b4c35-eb43-4ed4-89d9-11ab1636ab6f-41rfo

 docker logs b57a01c859ee
2024-11-23 20:25:48.851  INFO Aleo Miner start
2024-11-23 20:25:48.866  INFO connecting .
2024-11-23 20:25:48.876  INFO init gpu[0]
2024-11-23 20:25:49.460  INFO 7 new work received epoch=ab1u7geslq65vgpe8sf4l382qp07364tsffq4gdsc4cgxhca0c6dyysukn5vl proof_target 8388608
2024-11-23 20:26:40.853  INFO Found a solution with target 66901259 8388608 13061326203524808387
2024-11-23 20:26:40.952  INFO Pool response
2024-11-23 20:26:47.524  INFO Found a solution with target 8632326 8388608 13061306821571592329
2024-11-23 20:26:47.623  INFO Pool response
2024-11-23 20:26:48.876  INFO 
+--------------------------------------------------------------------------------------------------------+
|                                               Aleo Miner                                               |
+========================================================================================================+
| Start Time   2024-11-23T12:25:48                                                                       |
| Version      3.0.12                                                                                    |
| GPU(0)       405909                                                                                    |
| Speed(S/s)   405909.50                                                                                 |
+--------------------------------------------------------------------------------------------------------+
2024-11-23 20:26:49.036  INFO Pool response
2024-11-23 20:27:28.160  INFO Found a solution with target 9979425 8388608 13061343068092412128

nvidia-smi 
Sat Nov 23 13:42:33 2024       
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.171.04             Driver Version: 535.171.04   CUDA Version: 12.2     |
|-----------------------------------------+----------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |         Memory-Usage | GPU-Util  Compute M. |
|                                         |                      |               MIG M. |
|=========================================+======================+======================|
|   0  NVIDIA RTX A4000               Off | 00000000:00:10.0 Off |                  Off |
| 78%   90C    P2             139W / 140W |   4748MiB / 16376MiB |    100%      Default |
|                                         |                      |                  N/A |
+-----------------------------------------+----------------------+----------------------+
                                                                                         
+---------------------------------------------------------------------------------------+
| Processes:                                                                            |
|  GPU   GI   CI        PID   Type   Process name                            GPU Memory |
|        ID   ID                                                             Usage      |
|=======================================================================================|
|    0   N/A  N/A    145166      C   aleominer                                  4742MiB |
+---------------------------------------------------------------------------------------+


computing-provider ubi list --show-failed
1506070	                                          	GPU      	fil-c2 	verified	YES      	2024-11-23 11:25:35	
1505711	                                          	GPU      	fil-c2 	verified	YES      	2024-11-23 11:55:56	
1506642	                                          	GPU      	fil-c2 	verified	YES      	2024-11-23 12:25:42	
1504509	                                          	GPU      	fil-c2 	verified	YES      	2024-11-23 12:55:59	

computing-provider task list --type ecp -v 
TASK UUID                           	TASK NAME                           	IMAGE NAME                             	CONTAINER NAME                            	CONTAINER STATUS	REWARD	CREATE TIME         
e575c95a-fd35-4763-898f-8efdc59cd1c1	e575c95a-fd35-4763-898f-8efdc59cd1c1	swanchain254/aleo_mainnet_f2pool:3.0.12	e575c95a-fd35-4763-898f-8efdc59cd1c1-g1opq	terminated      	0.0000	2024-11-22 15:26:10	
371c823d-425b-4db9-a66b-5d93005785c5	371c823d-425b-4db9-a66b-5d93005785c5	swanchain254/aleo_mainnet_f2pool:3.0.12	371c823d-425b-4db9-a66b-5d93005785c5-kdv3g	terminated      	0.0000	2024-11-22 20:55:57	
92d8a3e3-b8ba-4042-a19f-dfe80920c342	92d8a3e3-b8ba-4042-a19f-dfe80920c342	swanchain254/aleo_mainnet_f2pool:3.0.12	92d8a3e3-b8ba-4042-a19f-dfe80920c342-3d5b0	terminated      	0.0000	2024-11-23 02:26:06	
af3fd426-bc24-4233-a2c8-5c7998049d84	af3fd426-bc24-4233-a2c8-5c7998049d84	swanchain254/aleo_mainnet_f2pool:3.0.12	af3fd426-bc24-4233-a2c8-5c7998049d84-rk5c0	terminated      	0.0000	2024-11-23 06:55:59	
8cdce2b4-3d75-43fa-bc43-315d7ba88e46	8cdce2b4-3d75-43fa-bc43-315d7ba88e46	swanchain254/aleo_mainnet_f2pool:3.0.12	8cdce2b4-3d75-43fa-bc43-315d7ba88e46-r7cbp	terminated      	0.0000	2024-11-23 10:26:00	
332b4c35-eb43-4ed4-89d9-11ab1636ab6f	332b4c35-eb43-4ed4-89d9-11ab1636ab6f	swanchain254/aleo_mainnet_f2pool:3.0.12	332b4c35-eb43-4ed4-89d9-11ab1636ab6f-41rfo	running         	0.0000	2024-11-23 13:25:48	






time="2024-11-23 13:44:26.441" level=error msg="failed to parse resp body: {\"code\":400,\"msg\":\"record not found\"}" func=checkAgain file="scan_task_payment.go:144"
[GIN] 2024/11/23 - 13:44:32 | 200 |    1.110317ms |   67.215.11.182 | GET      "/api/v1/computing/cp"
[GIN] 2024/11/23 - 13:46:51 | 200 |  232.671644ms |   67.215.11.182 | DELETE   "/api/v1/computing/cp/job/332b4c35-eb43-4ed4-89d9-11ab1636ab6f"
time="2024-11-23 13:47:25.357" level=info msg="collect hardware resource, freeCpu:24, freeMemory: 31.31 GiB, freeStorage: 611.77 GiB, freeGpu: map[NVIDIA A4000:1]" func=reportClusterResourceForDocker file="ubi_service.go:1084"
/home/user/go-computing-provider/internal/computing/scan_task_payment.go:167 record not found
[3.114ms] [rows:0] SELECT * FROM `t_scan_chain` WHERE `t_scan_chain`.`id` = 1 ORDER BY `t_scan_chain`.`id` LIMIT 1time="2024-11-23 13:47:26.037" level=error msg="failed to parse resp body: {\"code\":400,\"msg\":\"record not found\"}" func=checkAgain file="scan_task_payment.go:144"
time="2024-11-23 13:47:26.121" level=error msg="failed to parse resp body: {\"code\":400,\"msg\":\"record not found\"}" func=checkAgain file="scan_task_payment.go:144"
time="2024-11-23 13:47:26.205" level=error msg="failed to parse resp body: {\"code\":400,\"msg\":\"record not found\"}" func=checkAgain file="scan_task_payment.go:144"
time="2024-11-23 13:47:26.289" level=error msg="failed to parse resp body: {\"code\":400,\"msg\":\"record not found\"}" func=checkAgain file="scan_task_payment.go:144"
time="2024-11-23 13:47:26.374" level=error msg="failed to parse resp body: {\"code\":400,\"msg\":\"record not found\"}" func=checkAgain file="scan_task_payment.go:144"
time="2024-11-23 13:47:26.458" level=error msg="failed to parse resp body: {\"code\":400,\"msg\":\"record not found\"}" func=checkAgain file="scan_task_payment.go:144"
[GIN] 2024/11/23 - 13:48:45 | 200 |    1.151627ms |   38.104.153.43 | GET      "/api/v1/computing/cp"




Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant