My rig has gotten SICK
-
Aw man… everything was going great up until now!
I have been tweaking CGminer for many, many hours the past two days and I finally had it running pretty good.I’ll fill my worries into RIPPEDDRAGON’s template and hope someone comes to my rescue:
-----PC Specs-----
[b]GPU (including manufacture):[/b] XFX R7970 and ASUS HD 7990
[b]Mining program(s):[/b] CGminer 3.1.0
[b]Mining Pool:[/b] D2
[b]GPU Core Clock:[/b] 1000
[b]GPU Memory Clock:[/b] 1500
[b]Thread Concurrency:[/b] 8192
[b]Average KH/s:[/b] around 715KH/s per GPU.
[b]The rest of your miner settings:[/b]setx GPU_MAX_ALLOC_PERCENT 100
start cgminer.exe -d 0 --scrypt -o stratum+tcp://pool2.d2.cc:3333 -u X -p X --intensity 13 --worksize 256 -g 2 --thread-concurrency 8192 --gpu-memclock 1625 --gpu-engine 1025
start cgminer.exe -d 1 --scrypt -o stratum+tcp://pool2.d2.cc:3333 -u X -p X --intensity 13 --shaders 2048 --thread-concurrency 8192 -g 2 -w 256 --gpu-memclock 1500 --gpu-engine 1000
start cgminer.exe -d 2 --scrypt -o stratum+tcp://pool2.d2.cc:3333 -u X -p X --intensity 13 --shaders 2048 --thread-concurrency 8192 -g 2 -w 256 --gpu-memclock 1500 --gpu-engine 1000
[b]Any thing else you deem important:[/b] As you can see, I am running three instances of CGminer in order to configure the GPU’s in different ways. GPU 0 is the 7970, and GPU 1 and 2 are on the 7990. The gpu-memclock and gnu-engine values are taken straight from the manufacturer’s specs. The GPU’s get hot, but never over around 93C.
[b]The nature of your problem:[/b] After mining for a few minutes, the two GPU’s on the 7990 stop working, and I get the following messages:[i]GPU1: Idle for more than 60 seconds, declaring SICK!
GPU1: Attempting to restart
Thread 2 still exists, killing it off
Thread 3 still exists, killing it off
GPU2: Idle for more than 60 seconds, declaring SICK!
GPU still showing activity suggesting a hard hang.
Will not attempt to auto-restart it.[/i][b]Are you getting HW(hardware) errors when mining:[/b] No.
Any clues what’s going on here? I’m beginning to suspect that the 7990 is a bit heat sensitive and maybe it already got affected by the few days it’s been here. I’m going to go looking for a big fan to point at the rig tomorrow to see if that changes anything. All thoughts are welcome! :)
-
You don’t have a fan set up already/now to push the heat pockets between cards out?
93°C is way too high for my comfort. You need to find a way to get that in the 80 range.
-
Or lower, please be careful.
-
When i get the Sick message its because my drivers have crashed
My guess would be that running the 7990 on two different mining programs is causing the problem…merge the configs for the 7990 and run both gpus in the same instance. The are multiple reasons why that could cause it to crash but they all relate to two programs potentially causing deadlocks of some kind on that dual gpu card.
-
You should probably give it some soup. I recommend Cambell’s Chunky Healthy Chicken Noodle soup.
[url=http://www.amazon.com/Campbells-Healthy-Request-Classic-Microwavable/dp/B000V6L2FK/ref=sr_1_1?ie=UTF8&qid=1386387370&sr=8-1&keywords=soup]http://www.amazon.com/Campbells-Healthy-Request-Classic-Microwavable/dp/B000V6L2FK/ref=sr_1_1?ie=UTF8&qid=1386387370&sr=8-1&keywords=soup[/url]
But really you need to keep your cards cool. I recommend a temp of around 73 C
-
Thanks for the recommendations, guys.
The two cards are in the top and bottom slots, so there is some space in between and the rig is standing on top of a plastic crate. I have three 80mm fans pointed right at the three fans on the 7990, which gives it much more airflow. Before I put those up, it went to 100 pretty fast and would probably have continued to rise.
The door to the balcony is open, and the beginning of the Danish winter is making the room chilly - around 14C - 16C, I would say.
It really seems to me that it should be running pretty cool, but maybe this product just wants to die! >:([quote name=“RIPPEDDRAGON” post=“40890” timestamp=“1386386992”]
When i get the Sick message its because my drivers have crashedMy guess would be that running the 7990 on two different mining programs is causing the problem…merge the configs for the 7990 and run both gpus in the same instance. The are multiple reasons why that could cause it to crash but they all relate to two programs potentially causing deadlocks of some kind on that dual gpu card.
[/quote]That makes some sense. I took the card out and went to bed now though. Gotta try that sleep thing before I do anything else. But I’ll try out your suggestion tomorrow. Thanks!
Can I put “-d 1 -d 2” in the same line to make it run both GPU’s in the same instance of CGminer? If not, I’ll try some settings that both cards should be able to handle.If that doesn’t help, I have some Arctic Silver 5 thermal paste arriving tomorrow or monday. But I’m not sure I want to do anything with that card. Even if I get the heatsink to work better, it just seems so eager to get those temperatures up, so I’m not sure it would be enough. I’m beginning to think this thing shouldn’t be used for mining, unless maybe with water cooling. It is doing around 1,450KH/s when it’s running, so I’d like to keep it but it sure would hurt to fry it.
Depending on what I figure out tomorrow, I might be very happy and keep it, or just return it and get my money back. At the moment, the card is working… just not for mining.
-
I read somewhere, the ASUS and XFX cards are not good for mining, recommend MSI cards at that level… ???
experienced to say about this?? :P
-
[quote name=“Ernesto” post=“40914” timestamp=“1386390570”]
I read somewhere, the ASUS and XFX cards are not good for mining, recommend MSI cards at that level… ???experienced to say about this?? :P
[/quote]I personally have no experience with either companies as I prefer the fans on the Gigabyte however you can check for yourself
-
Actually, the XFX 7970 is not too bad. It performs around 720Kh/s, and stays between 81 - 84C most of the time.
It’s the ASUS 7990 that is causing me a headache.I went out and got a fan to put next to the rig. I have removed the 7970 and reinstalled the 7990 now. I tried placing the fan in all positions around the rig and ended up placing it to send even more air to the fans on the video card as this worked best.
This is what it looks like: https://dl.dropboxusercontent.com/u/3267314/DSC_0371.JPG
I have closed the balcony door, and the room temperature is 19.5C now.
GPU 0 is at a steady 85C, but GPU 1 is at 88 - 90C.
But it doesn’t get “SICK” anymore. This might be because I’m running both GPU’s in the same instance of CGminer now.This friggin’ card can’t be normal, can it? Shouldn’t I just hurry up and return it while I can?
EDIT: These are the settings I’m running the 7990 at right now:
cgminer.exe --scrypt -o stratum+tcp://pool2.d2.cc:3333 -u X -p X --intensity 13 --shaders 2048 --thread-concurrency 8192 -g 2 -w 256
This is getting me an average of 1.435Mh/s, so it really is performing quite well. -
Alright. Thanks, MrFeathers. I’m returning the ASUS monday.
Maybe I will get some water cooling with some of the money I get back from that. Gotta get rid of that noise to not upset my flat mates any more than they already are. That growling monster isn’t too popular and neither am I right now! ::)