Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <2118da9a8ecaa0474c6db8176889c858@smtp.hushmail.com>
Date: Tue, 6 Oct 2015 19:42:24 +0200
From: magnum <john.magnum@...hmail.com>
To: john-users@...ts.openwall.com
Subject: Re: nVidia Maxwell support (especially descrypt)?

On 2014-09-11 10:57, Solar Designer wrote:
> On Wed, Sep 10, 2014 at 11:02:17PM -0800, Royce Williams wrote:
>> Because of the $150 price and relatively low power requirements (~60W,
>> no extra power connector needed) of the new nvidia Maxwell (GTX 750
>> and 750 Ti) cards, and my own interest in descrypt, I'm interested in
>> seeing JtR take advantage of Maxwell if feasible.
>>
>> Steve Thomas did a Passwords14 presentation on bitslice DES with
>> LOP3.LUT (23 min video):
>>
>> http://www.irongeek.com/i.php?page=videos/passwordscon2014/bitslice-des-with-lop3lut-steve-thomas
>
> Yes, I watched a video of Steve's talk a while ago.  He brought the
> topic up fine, but other than that I was disappointed: he had no results
> yet at least as of the time of this talk.  Steve's best average gate
> count per S-box by using LOP3.LUT (36.25 gates) was still worse than
> Roman's result from 2011 for plain bitselect(), which is 32.875 gates (...)

Solar, Sayantan,

Have you seen this work by Janet Yellen? I can't recall it mentioned here.

https://github.com/DeepLearningJohnDoe/merikens-tripcode-engine/tree/master
https://devtalk.nvidia.com/default/topic/860120/cuda-programming-and-performance/bitslice-des-optimization/post/4622827/#4622827

"Gate counts: 25 24 25 18 25 24 24 23 (avg. 23.5)
  Depth: 8 7 7 6 8 10 10 8 (avg. 8)"

"With this version, I get a performance of 950 MH/s for UNIX DES 
crypt(3) (or equivalently 23750 MH/s for 1 round of DES) on my reference 
Gigabyte GTX 980 Ti (+270 MHz). Considering hashcat's implementation 
gets 165.5 MH/s on a GTX Titan X (+225 MHz), it's a great improvement. 
Even my naive implementation bounded by shared memory/synchronization 
with old SBOXes from JtR is faster (300 MH/s on 980 Ti +300 MHz)."

magnum

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.