AI helps Pixeldrive cut photo file sizes without substantially affecting quality | Industry
Typical compression schemes strip data away from files, and when it comes to photos, the results are often splotches, color banding, and other unpleasant artifacts. Try this experiment: Take a snapshot with your smartphone and upload it to Facebook or Twitter. Download the uploaded (and now compressed) image from the web and compare it to the original. You’ll spot the differences pretty quickly.
But there’s a way to shrink pics without compromising their quality — or so claim Migel Tisserra, the former machine learning lead at oil and gas giant Santos, and Francis Doumet, a Stanford graduate with two successful businesses under his belt. They’re the cofounders of Pixeldrive, a web-based artificially intelligent (AI) tool that cuts photos down to as little as 10 percent of their original size.
Pixeldrive — development around which kicked off eight months ago ahead of a planned launch in October — expands on research by Google and others investigating the use of neural networks in image compression.
“We set out to solve this general problem that we thought wasn’t really being addressed,” Doumet told VentureBeat in a phone interview. “Everyone is interested in this space. With the demand for bandwidth and cloud storage increasing, the more we can compress files intelligently, the better.”
He’s right about the demand. From 2012 to 2020, the amount of data produced is forecast to exceed 40 zettabytes — the equivalent of 5,200GB of data for every person on earth, according to Digital Universe. That’s one reason the cloud storage market is forecast to grow from $30.70 billion in 2017 to $88.91 billion by 2022.
To put Pixeldrive’s compression technique in context, it’s worth breaking down the difference between lossless and lossy compression. At a high level, lossless algorithms work by splitting files into bite-sized “chunks” and arranging those chunks in a highly efficient manner. Lossy algorithms get rid of data — often color information and pixels. The former respects file integrity — when the files get decompressed, they’re recreated exactly.
Pixeldrive takes the lossy approach. Its model can achieve 20 times compression that bests the likes of JPEG, JPEG-2000, WebP, and BPG, Tissera and Doumet claim, with the superior peak signal-to-noise ratio (PSNR) and mean square error values (two error metrics used to compare image compression quality), with the added benefit of automatic noise reduction.
Photos make their way to Pixeldrive via one of two flows: an upload tool within a web-based dashboard built on Nextcloud, and integrations with third-party services like Dropbox. Plugins for Google Drive and game engine Unity are on the way, Doumet said.
“We’ve gotten a lot of requests from game developers looking to optimize their assets and textures,” he told me. “Many of them use to reduce [games’] initial install size.”
Under the hood lies a convolutional neural network built using Google’s open source TensorFlow framework that converts images to MLVX (short for “machine learning visual extension”), a custom file type created by Pixeldrive’s engineering team.
I’ve been using Pixeldrive for the past week, and it appeared to work as advertised in my tests. Most of the pics I uploaded took a second or two to process and ended up at about half the size — 47KB to 21KB in one instance; 163KB to 82KB in another; and 131KB to 67KB. (Doumet said that compression rates tend to increase as the size and resolution of the original image increases.) As far as image quality is concerned, my (admittedly untrained) eye had a tough time distinguishing between the original and compressed versions.
The results weren’t always perfect, though. Note the faded columns in the compressed image:
I stuck with Pixeldrive’s web app for the most part, which boasts a few basic sorting and filtering tools. By default, pictures appear in a list view indicating their size and the date they were uploaded. There’s built-in commenting and sharing — you can attach a note to a photo, generate a link to it, or quickly attach it to an email — and a manual tagging system to help keep things organized.
Also on tap is a profile builder with fields for social media accounts, phone numbers, and addresses. The idea is that eventually, once Pixeldrive launches publicly, users within groups — a family or a large enterprise, for example — will be able to share photos with each other via a contacts list.
A gallery view lets you view photos in a grid format, which I found handy. Other nice-to-haves include per-file version history (which shows how often a picture has been modified), a deleted photos folder, and a recent tab that surfaces pics uploaded since the last session.
Pixeldrive is strictly a web-based affair for now, but the team’s hard at work on desktop clients for Windows 10 and macOS. In the future, Tisserra and Doumet hope to decouple the platform from the Google Cloud — Pixeldrive’s current machine learning backend — and bring it to mobile devices. In fact, they said they’re already in talks with an unnamed phone manufacturer to integrate Pixeldrive’s AI into a photo-sharing app.
But hardware is the stumbling block. Even on cutting-edge handsets with dedicated inference chips, Pixeldrive’s AI model takes three times as long to compress images, Doumet said.
Here’s the dollars and cents of it: The first 100MB of storage on Pixeldrive is free, but you’ll have to pony up for any more than that. A Basic plan with 10GB of storage starts at $4.99 a month, the Advanced plan with 25GB is $7.99, and the Pro plan with 100GB is $9.99.
The economics won’t make sense for everyone — depending on your current storage solution, picking up a few physical hard drives might be the smarter move. But if you’re hellbent on a cloud solution, Pixeldrive might be worth a few of your hard-earned dollars.