supercomputing – Page 33 – Lifeboat News: The Blog

12.25.
2016

Jülich Installs New QPACE3 Supercomputer for Quantum Chromodynamics

qpace A new supercomputer has been deployed at the Jülich Supercomputing Center (JSC) in Germany. Called QPACE3, the new 447 Teraflop machine is named for “QCD Parallel Computing on the Cell.”

QPACE3 is being used by the University of Regensburg for a joint research project with the University of Wuppertal and the Jülich Supercomputing Center for numerical simulations of quantum chromodynamics (QCD), which is one of the fundamental theories of elementary particle physics. Such simulations serve, among other things, to understand the state of the universe shortly after the Big Bang, for which a very high computing power is required.

The demand for high performance computers to solve complex applications has risen exponentially, but unfortunately so has their consumption of power. Many supercomputers require more than a megawatt of electricity to operate and annual electricity costs can easily run into millions of Euros. The energy supply is therefore a significant part of the operating costs of a data center. According to recent analyst studies, this represents the second-largest factor in addition to personnel and maintenance costs. The upcoming boom with (3D) video streaming, augmented reality, image recognition and artificial intelligence is driving up the demand for data center capabilities, thereby placing new challenges in the power supply sector.

12.16.
2016

Does The Universe Have a Hard Drive?

Zura Kakushadze is lead author of this peer reviewed paper published by the Free University of Tbilisi. It describes an information paradox that arises in a materialist’s description of the Universe—if we assume that the Universe is 100% quantum. The observation of the paradox stems from an interdisciplinary thought process whereby the Universe can be viewed as a “quantum computer”.

The presentation is intentionally nontechnical to make it accessible to a wide a readership.

12.15.
2016

MSFT Stock: Is This Microsoft Corporation’s (MSFT) Next Big Play?

Nice write up and references the Cognitive Toolkit that was leveraged on Skype, Xbox, etc. Also, a nice plug on the QC work.

“Only Cray can bring the combination of supercomputing technologies, supercomputing best practices, and expertise in performance optimization to scale deep learning problems,” said Dr. Mark S. Staveley, Cray’s director of deep learning and machine learning. “We are working to unlock possibilities around new approaches and model sizes, turning the dreams and theories of scientists into something real that they can explore. Our collaboration with Microsoft and CSCS is a game changer for what can be accomplished using deep learning.”

Also Read: Ignore The Financials, MSFT Stock Is Headed Higher : Microsoft Corporation (NASDAQ: MSFT)

“Cray’s proficiency in performance analysis and profiling, combined with the unique architecture of the XC systems, allowed us to bring deep learning problems to our [system] and scale them in a way that nobody else has,” added Prof. Dr. Thomas C. Schulthess, director of the Swiss National Supercomputing Centre (CSCS). “What is most exciting is that our researchers and scientists will now be able to use our existing Cray XC supercomputer to take on a new class of deep learning problems that were previously infeasible.”

12.14.
2016

IBM’s Watson supercomputer discovers 5 new genes linked to ALS

IBM Watson is known for its work in identifying cancer treatments and beating contestants on Jeopardy! But now the computing system has expertise in a new area of research: neuroscience.

Watson discovered five genes linked to ALS, sometimes called Lou Gehrig’s disease, IBM announced on Wednesday. The tech company worked with researchers at the Barrow Neurological Institute in Phoenix, Arizona. The discovery is Watson’s first in any type of neuroscience, and suggests that Watson could make discoveries in research of other neurological diseases.

12.11.
2016

Proof that Moore’s Law has been replaced by a Virtual Moore’s Law that is Accelerating and Bringing the Singularity With It

Introduction

Moore’s Law says that the number of transistors per square inch will double approximately every 18 months. This article will show how many technologies are providing us with a new Virtual Moore’s Law that proves computer performance will at least double every 18 months for the foreseeable future thanks to many new technological developments.

This Virtual Moore’s Law is propelling us towards the Singularity where the invention of artificial superintelligence will abruptly trigger runaway technological growth, resulting in unfathomable changes to human civilization.

Going Vertical

In the first of my “proof” articles two years ago, I described how it has become harder to miniaturize transistors, causing computing to go vertical instead. 2 years ago, Samsung was mass producing 24-layer 3D NAND chips and had announced 32-layer chips. As I write this, Samsung is mass producing 48-layer 3D NAND chips with 64-layer chips rumored to appear within a month or so. Even more importantly, it is expected that by the end of 2017, the majority of NAND chips produced by all companies will be 3D. Currently Samsung and its competitors are working 24/7 to transform their 2D factories to 3D factories causing a dramatic change in how NAND flash chips are created.

Cross section of 48-layer 3D NAND chip (Learn more!)

Going Massively Parallel

Moore’s law only talks about the number of transistors per square inch. It doesn’t directly mean that a chip will run any faster. Unfortunately, since 2006 Intel’s CPUs have dramatically slowed their increase in performance, averaging about 10% a year. The cause of this problem is that the average Intel CPU only has 2 to 4 cores and it has become difficult to speed up these cores.

Nvidia has been promoting a different architecture, which averages thousands of cores. In 2016, they had a HUUGE success with this idea causing their company to soar in value with the market capitalization of Nvidia reaching about one third of Intel’s value. (This is impressive as Intel is a very profitable company with profits exceeding $15 billion in 2015.)

titan-x-pascal

Titan X Pascal is 566% faster at AI than last year’s model!

Exactly what did Nvidia achieve? Their new Titan X runs AI instructions 566% faster than the old Titan X card that was only released a year earlier. Also, Nvidia got a half-size version of their Drive PX 2 put in all Tesla cars and this chip is about 4000% faster than the chip it replaced. Finally, Nvidia is working on a successor to the Drive PX 2 chip called Xavier that is rumored to come out in about a year and to be at least 400% more energy efficient than the current chip.

These numbers of 566%, 4000%, and 400% are much bigger than Intel’s 10% and are causing a fundamental change in how computing is done. It is worth noting that Nvidia’s main competitor AMD has also been blowing past Intel’s 10% annual performance gains so the idea of many cores has been proven by multiple companies. In fact, even the graphical part of Intel’s chips has been blowing past this 10% performance gain per year.

Thousands of programs have been designed to take advantage of this parallel computing performance. For example, the program BlazingDB runs over 100 times as fast as MySQL which was only designed to run on CPUs. As the performance gap increases between a standard CPU with a handful of cores and a GPU with thousands of cores, more and more programs are being written to take advantage of GPUs. And the growing market for massively parallel chips means that Nvidia can now afford to spend lots of money in making their chips better. For example, their latest generation of chips called Pascal cost over $2 billion to develop. (All this money goes into making a better chip design, Nvidia doesn’t actually build chips. They currently use TSMC and Samsung for that.)

Lightning-Fast Data

For a long time, data for programs was stored on slow hard drives. Then it was moved to SATA SSDs which rapidly sped up each year until they finally hit the bandwidth limits of the SATA standard. Now data is moving to PCIe SSDs that currently have 6 times the bandwidth of SATA drives with even faster PCIe drives planned. (A PCIe drive that used 16 lanes like a graphics card would have 4 times the bandwidth of current PCIe drives.) Both Intel’s coming Optane 3D XPoint SSDs and Samsung’s Z-NAND SSDs are examples of such faster PCIe drives and a handful of enterprise SSDs already exist that use 16 lanes.

Even faster than all these drives is the idea of storing everything in memory which is becoming more and more common. When Watson won at Jeopardy in 2011, it used the trick of using its 16 terabytes of RAM to store everything in memory instead of using its drives during the competition. Today Samsung sells 2.5D memory cards that hold 128GB each.

Intel’s Xeons with the highest memory capacity can handle 3 terabytes of memory per chip and motherboards are being sold that can hold 3 terabytes of Samsung’s 2.5D memory. 2.5D means that four layers of chips are “soldered” on top of each other. (Cheaper non-Xeon systems now hold as much as 128GB 2D memory which is pretty good for a home computer.)

Computers Programming Computers

computer-programming

Nvidia’s CEO Jen-Hsun Huang said, “AI is going to increase in capability faster than Moore’s Law. I believe it’s a kind of a hyper Moore’s Law phenomenon because it has the benefit of continuous learning. It has the benefit of large-scale networked continuous learning. Today, we roll out a new software package, fix bugs, update it once a year. That rhythm is going to change. Software will learn from experience much more quickly. Once one smart piece of software on one device learns something, then you can over-the-air (OTA) it across the board. All of a sudden, everything gets smarter.”

Computers Designing Chips

Since the mid-1970s, programs have been used to design chips as chips have become too complicated for any team of humans to handle. (Nvidia’s Tesla P100 GPUs have 150 billion transistors when you include the memory “soldered” to the top of them!)

A quantum leap in chip design may happen in the near future as Nvidia recently built a supercomputer for internal research out of mainly Nvidia Tesla P100 GPUs. This supercomputer was ranked 28 out of all computers in the world. What will this computer be used for?

Nvidia said, “We’re also training neural networks to understand chipset design and very-large-scale-integration, so our engineers can work more quickly and efficiently. Yes, we’re using GPUs to help us design GPUs.”

This is a very interesting area to watch as today’s chips are so complicated that they are likely very inefficient with massive speedups being available if we could find a better way to optimize them. An example of the gains possible is that Nvidia got about a 50% performance increase between its Kepler and Maxwell generations despite both microarchitectures using the same 28nm technology.

Conclusion

The new Virtual Moore’s Law is already having a massive effect. Jen-Hsun said, “By collaborating with AI developers, we continued to improve our GPU designs, system architecture, compilers, and algorithms, and sped up training deep neural networks by 50x in just three years — a much faster pace than Moore’s Law.”

With chips going vertical, chip architectures going massively parallel, lightning-fast data, computers programming computers, and computers designing chips, the Singularity is closer than you think!

Technical Note for Geeks

Here is what I actually meant by “soldered”:

Conventional chip packages interconnect die stacks using wire bonding, whereas in TSV packages, the chip dies are ground down to a few dozen micrometers, pierced with hundreds of fine holes and vertically connected by electrodes passing through the holes, allowing for a significant boost in signal transmission. TSV stands for Through-Silicon Vias.

11.28.
2016

MIT’s new method of radio transmission could one day make wireless VR a reality

If you want to use one of today’s major VR headsets, whether the Oculus Rift, the HTC Vive, or the PS VR, you have to accept the fact that there will be an illusion-shattering cable that tethers you to the small supercomputer that’s powering your virtual world.

But researchers from MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) may have a solution in MoVr, a wireless virtual reality system. Instead of using Wi-Fi or Bluetooth to transmit data, the research team’s MoVR system uses high-frequency millimeter wave radio to stream data from a computer to a headset wirelessly at dramatically faster speeds than traditional technology.

There have been a variety of approaches to solving this problem already. Smartphone-based headsets such as Google’s Daydream View and Samsung’s Gear VR allow for untethered VR by simply offloading the computational work directly to a phone inside the headset. Or the entire idea of VR backpacks, which allow for a more mobile VR experience by building a computer that’s more easily carried. But there are still a lot of limitations to either of these solutions.

11.25.
2016

Why the US Is Losing Ground on the Next Generation of Powerful Supercomputers

“I feel the need — the need for speed.”

The tagline from the 1980s movie Top Gun could be seen as the mantra for the high-performance computing system world these days. The next milestone in the endless race to build faster and faster machines has become embodied in standing up the first exascale supercomputer.

Exascale might sound like an alternative universe in a science fiction movie, and judging by all the hype, one could be forgiven for thinking that an exascale supercomputer might be capable of opening up wormholes in the multiverse (if you subscribe to that particular cosmological theory). In reality, exascale computing is at once more prosaic — a really, really fast computer — and packs the potential to change how we simulate, model and predict life, the universe and pretty much everything.

11.19.
2016

Liquid silicon: Computer chips could bridge gap between computation and storage

Computer chips in development at the University of Wisconsin–Madison could make future computers more efficient and powerful by combining tasks usually kept separate by design.

Jing Li, an assistant professor of electrical and computer engineering at UW–Madison, is creating computer chips that can be configured to perform complex calculations and store massive amounts of information within the same integrated unit — and communicate efficiently with other chips. She calls them “liquid silicon.”

“Liquid means software and silicon means hardware. It is a collaborative software/hardware technique,” says Li. “You can have a supercomputer in a box if you want. We want to target a lot of very interesting and data-intensive applications, including facial or voice recognition, natural language processing, and graph analytics.”

11.07.
2016

Optical laser computing Could Power Up Genomics and AI and Optalysys targets one petaflop next year

https://youtube.com/watch?v=KPFnmGRZ8GQ

Optalysys’s technology performs a mathematical function called the Fourier transform by encoding data, say a genome sequence, into a laser beam. The data can be manipulated by making light waves in the beam interfere with one another, performing the calculation by exploiting the physics of light, and generating a pattern that encodes the result. The pattern is read by a camera sensor and fed back into a conventional computer’s electronic circuits. The optical approach is faster because it achieves in a single step what would take many operations of an electronic computer.

The technology was enabled by the consumer electronics industry driving down the cost of components called spatial light modulators, which are used to control light inside projectors. The company plans to release its first product next year, aimed at high-performance computers used for processing genomic data. It will take the form of a PCI express card, a standard component used to upgrade PCs or servers usually used for graphics processors. Optalysys is also working on a Pentagon research project investigating technologies that might shrink supercomputers to desktop size, and a European project on improving weather simulations.

In 2015, Optalysis built a prototype that achieves a processing speed equivalent to 320 Gflops and it is incredibly energy efficient as it uses low-powered, cost effective components.

11.03.
2016

Scientists May Have Identified the Particles That Make Up Dark Matter

In Brief:

Using an advanced supercomputer, scientists came up with a profile for dark matter, concluding that it may be made of axions of a specific type.
With this new information, the race is on to be the first to prove the existence of dark matter particles.

Understanding what dark matter is has proven to be amazingly difficult. Of course, one might expect this from a thing that is, for all intents and purposes, entirely invisible. Scientists have come to the conclusion that dark matter exists by observing the way gravity behaves—either our model of gravity is in need of an update, or dark matter exists. The latter is the most likely conclusion.