[Dev] NeoScrypt GPU Miner - Public Beta Test
-
My 1st guess it runs out of private memory. It takes 512 bytes for block mixing + 800 bytes for FastKDF and BLAKE2s per kernel instance. That’s not including local variables, counters, etc. Scrypt consumes 3 times less private memory. It’s opposite for global memory requirements, so you are not going to exceed them. Although the GCN cards report about the same amounts of local and constant memory (32Kb + 64Kb), they also have 32Kb of L1 cache which may help. Maybe they also have more private space (registers). Global memory is used for V space only. Not much activity there. Everything else runs in private/local space.
Another guess there is something wrong with the miner itself related to scheduling of kernel threads. Increase intensity over 13 and hash rate reduces. Increase it even more and see HW errors. Set to 20 and it hangs up. Scrypt can do 20, but it’s different. Need to start with a clean fork and add the NeoScrypt support myself probably. Have a few other ideas, but they also need work.
-
My 1st guess it runs out of private memory. It takes 512 bytes for block mixing + 800 bytes for FastKDF and BLAKE2s per kernel instance. That’s not including local variables, counters, etc. Scrypt consumes 3 times less private memory. It’s opposite for global memory requirements, so you are not going to exceed them. Although the GCN cards report about the same amounts of local and constant memory (32Kb + 64Kb), they also have 32Kb of L1 cache which may help. Maybe they also have more private space (registers). Global memory is used for V space only. Not much activity there. Everything else runs in private/local space.
Another guess there is something wrong with the miner itself related to scheduling of kernel threads. Increase intensity over 13 and hash rate reduces. Increase it even more and see HW errors. Set to 20 and it hangs up. Scrypt can do 20, but it’s different. Need to start with a clean fork and add the NeoScrypt support myself probably. Have a few other ideas, but they also need work.
I feel stupid. For some reason, I was thinking of GCN cards while talking about 6xxx. Oops.
-
Preparing my GCN kernel for public release; cleaning code, removing stuff I tried that really sucked, like completely unrolled chacha/salsa, stuff like that. After that, I’ll package it up with SGMiner and it should be good to go. Should give results like this (NSFW): https://ottrbutt.com/miner/neoscryptwolf-11082014.png
-
Preparing my GCN kernel for public release; cleaning code, removing stuff I tried that really sucked, like completely unrolled chacha/salsa, stuff like that. After that, I’ll package it up with SGMiner and it should be good to go. Should give results like this (NSFW): https://ottrbutt.com/miner/neoscryptwolf-11082014.png
Those numbers look great, can’t wait to try this. :)
Does the version of SGMiner your building have xIntensity or have you given any thought to using cgminer 3.7.3 Kalroth that has xIntensity for a build?
More info can be found here from that page it states the new SGMIner 4.1 has xintensity and might be a better choose. Personally I like
cgminer better and had better results with it than sgminer so far.
-
Those numbers look great, can’t wait to try this. :)
Does the version of SGMiner your building have xIntensity or have you given any thought to using cgminer 3.7.3 Kalroth that has xIntensity for a build?
More info can be found here from that page it states the new SGMIner 4.1 has xintensity and might be a better choose. Personally I like
cgminer better and had better results with it than sgminer so far.
Doesn’t matter - kernel can be used with both.
EDIT: It can be used with any CGMiner/SGMiner that has Neoscrypt support, that is.
-
WHOOOOOOOOOOOA!!!
Installed 14.9 drivers and got cgminer 3.8.7
The result:
hashrate jumped from 95 to 135!!! :))) Same temps!!!
-
WHOOOOOOOOOOOA!!!
Installed 14.9 drivers and got cgminer 3.8.7
The result:
hashrate jumped from 95 to 135!!! :))) Same temps!!!
Was that your first time using my fixed kernel on 14.9?
-
Was that your first time using my fixed kernel on 14.9?
Actually yes… i guess i am being a bit slow on those updates :D Good job! Thanks for your involvement
-
Actually yes… i guess i am being a bit slow on those updates :D Good job! Thanks for your involvement
No problem; you should be getting more hash soon!
-
No problem; you should be getting more hash soon!
Let me step right up and personally thank you for the development you have done on this.
Post a btc address and I’ll send you a couple satoshi, or post a guncoin address and I’ll send you a couple thousand. ;)
-
It’s 19.5KH/s now on a HD6970. FastKDF and BLAKE2s have been cleaned up and optimised, memory requirements reduced.
Yeah, I’ve mentioned this in my white paper. Not sure if it’s of any use for mining.
It is, but that’s not what concerns me now. With FastKDF removed, the kernel gets reduced in size by ~60% and outputs 30KH/s.That’s a big overhead, but not critical and I’ve expected more out of ChaCha + Salsa. With ChaCha only enabled, it’s 58KH/s and with Salsa only = 56KH/s. Scalar Salsa isn’t supposed to be about as fast as vectorised ChaCha. It’s clearly scalar because the AMD compiler isn’t really smart and the kernel size is about double of ChaCha only size. Anyway, there is a huge bottleneck somewhere and it needs to be identified.
Don’t work for me now. Cards in rig: 6950,6870,5870, miner 3.7.7b. Screen:
-
Don’t work for me now. Cards in rig: 6950,6870,5870, miner 3.7.7b. Screen:
Post more information.
For example:
Windows version and if it is 64 or 32-bit
AMD Catalyst drivers
what worksize are you using?
have you set the following environmental vars?
GPU_MAX_ALLOC_PERCENT=100
GPU_USE_SYNC_OBJECTS=1
also, upgrade to 3.7.7c or use sgminer
-
I submitted a bug report to sgminer-dev github in regards to the apparent worksize issue in sgminer-dev binaries on Win64.
-
Post more information.
For example:
Windows version and if it is 64 or 32-bit
AMD Catalyst drivers
what worksize are you using?
have you set the following environmental vars?
GPU_MAX_ALLOC_PERCENT=100
GPU_USE_SYNC_OBJECTS=1
also, upgrade to 3.7.7c or use sgminer
Yep, i forgot this settings: GPU_MAX_ALLOC_PERCENT=100 , GPU_USE_SYNC_OBJECTS=1… on sgminer5 now works fine, thanks! 24 kh/s on radeon 6870 :))
-
Let me step right up and personally thank you for the development you have done on this.
Post a btc address and I’ll send you a couple satoshi, or post a guncoin address and I’ll send you a couple thousand. ;)
I will once the release happens; still waiting on a withdrawal.
-
Don’t work for me now. Cards in rig: 6950,6870,5870, miner 3.7.7b. Screen:
MAX_GLOBAL_THREADS equals to CONCURRENT_THREADS in SGminer and other non-customised miners. This is --thread-concurrency.
-
Great work , neoscrypt_vliw.cl v2 is very good ,my 6770 card can work. :)
-
I updated my drivers from 14.4 to 14.9 on an HD 6950 with shaders unlocked (Windows 7 64 bit). My hashrates increased a little, but I stopped getting accepteds. I tried Wolf0s’ sgminer build, and cgminer 3.7.8. I even tried redownloading sgminer to start with a clean slate.
I am using GPU_MAX_ALLOC_PERCENT=100
GPU_USE_SYNC_OBJECTS=1
I tried reducing intensity, lowering -w, and some other stuff.
I uninstalled the 14.9 and reinstalled 13.12 drivers (I didn’t really love 14.4) and I get accepteds again.
Still at 16 Kh/s when newer cards get almost 10X that. On both scrypt and groestl, HD 6950s are good for about 1/2 what a 290X gets (450 Kh/s scrypt and 7.5 Mh/s groestl.) But it is much better than it was, so thanks Wolf0 and Ghostlander. Hopefully we’ll get up to 50-75 Kh/s soon on the older hardware.
-
Hi, so im using CGMiner 3.7.8 with 14.9 7970 with R9 280X bios: cgminer --neoscrypt -I 15 -g 2 -w 48 --gpu-memclock 1400 --gpu-engine 1000
Im getting 110Kh/s, Do you guys think this is about right? Or should i be getting more?
Cheers
-
Thats about right, depends on the memory type, some of my 7970s/280x wont go above 135 at 1125/1500 while other ramp all the way up to 160-170 khs