mirror of synced 2024-05-07 22:42:19 +12:00

Go to file

nagadomi d818923f33 Revert anime_style_art_rgb/scale2.0x_model.t7		2016-03-27 20:20:17 +09:00
appendix	Add benchmark results	2016-03-17 17:40:41 +09:00
assets	Add support for level 3 noise reduction on web	2016-03-27 17:35:34 +09:00
cache	first commit	2015-05-16 14:48:05 +09:00
data	Fix .gitignore	2015-11-15 12:34:37 +09:00
images	Update supplementary material	2015-11-15 09:58:49 +09:00
lib	Change default validation_crops(160)	2016-03-22 10:19:52 +09:00
models	Revert anime_style_art_rgb/scale2.0x_model.t7	2016-03-27 20:20:17 +09:00
tools	Remove unused attributes in json	2016-03-27 17:37:38 +09:00
webgen	Add support for level 3 noise reduction on web	2016-03-27 17:35:34 +09:00
.gitattributes	Add .gitattributes	2015-11-12 03:39:40 +09:00
.gitignore	Update .gitignore	2015-11-22 13:10:26 +09:00
convert_data.lua	Fix undefined variable in convert_data.lua	2015-11-16 13:44:49 +09:00
LICENSE	add LICENSE and NOTICE	2015-05-17 17:26:53 +09:00
NOTICE	Update NOTICE	2016-02-07 09:56:02 +09:00
README.md	Update README	2016-03-27 19:08:58 +09:00
train.lua	Optionalize downsampling filters	2016-03-17 17:58:37 +09:00
train.sh	Update training script	2016-03-27 19:05:57 +09:00
train_photo.sh	Update training script	2016-03-27 19:05:57 +09:00
train_ukbench.sh	Change the jpeg config for the photo model	2015-11-15 09:36:40 +09:00
waifu2x.lua	Merge branch 'master' of github.com:nagadomi/waifu2x into dev	2016-03-21 03:59:17 +09:00
web.lua	Add support for level 3 noise reduction on web	2016-03-27 17:35:34 +09:00

README.md

waifu2x

Image Super-Resolution for Anime-style art using Deep Convolutional Neural Networks. And it supports photo.

Demo-Application can be found at http://waifu2x.udp.jp/ .

Summary

Click to see the slide show.

References

waifu2x is inspired by SRCNN [1]. 2D character picture (HatsuneMiku) is licensed under CC BY-NC by piapro [2].

[1] Chao Dong, Chen Change Loy, Kaiming He, Xiaoou Tang, "Image Super-Resolution Using Deep Convolutional Networks", http://arxiv.org/abs/1501.00092
[2] "For Creators", http://piapro.net/en_for_creators.html

Public AMI

Region: us-east-1 (N.Virginia)
AMI ID: ami-568f823c
AMI NAME: waifu2x-server
Instance Type: g2.2xlarge
OS: Ubuntu 14.04
User: ubuntu
Created at: 2016-03-22

See ~/README.md

Please update the git repo first.

git pull

Third Party Software

Third-Party

If you are a windows user, I recommend you to use waifu2x-caffe(Just download from releases tab) or waifu2x-conver-cpp.

Dependencies

Hardware

NVIDIA GPU

Platform

LuaRocks packages (excludes torch7's default packages)

lua-csnappy
md5
uuid
turbo

Installation

Setting Up the Command Line Tool Environment

(on Ubuntu 14.04)

Install CUDA

See: NVIDIA CUDA Getting Started Guide for Linux

Download CUDA

sudo dpkg -i cuda-repo-ubuntu1404_7.5-18_amd64.deb
sudo apt-get update
sudo apt-get install cuda

Install Package

sudo apt-get install libsnappy-dev
sudo apt-get install libgraphicsmagick-dev

Install Torch7

See: Getting started with Torch

And install luarocks packages.

luarocks install graphicsmagick # upgrade
luarocks install lua-csnappy
luarocks install md5
luarocks install uuid
PREFIX=$HOME/torch/install luarocks install turbo # if you need to use web application

Getting waifu2x

git clone --depth 1 https://github.com/nagadomi/waifu2x.git

Validation

Testing the waifu2x command line tool.

th waifu2x.lua

Web Application

th web.lua

View at: http://localhost:8812/

Command line tools

Noise Reduction

th waifu2x.lua -m noise -noise_level 1 -i input_image.png -o output_image.png

th waifu2x.lua -m noise -noise_level 2 -i input_image.png -o output_image.png
th waifu2x.lua -m noise -noise_level 3 -i input_image.png -o output_image.png

2x Upscaling

th waifu2x.lua -m scale -i input_image.png -o output_image.png

Noise Reduction + 2x Upscaling

th waifu2x.lua -m noise_scale -noise_level 1 -i input_image.png -o output_image.png

th waifu2x.lua -m noise_scale -noise_level 2 -i input_image.png -o output_image.png
th waifu2x.lua -m noise_scale -noise_level 3 -i input_image.png -o output_image.png

Batch conversion

find /path/to/imagedir -name "*.png" -o -name "*.jpg" > image_list.txt
th waifu2x.lua -m scale -l ./image_list.txt -o /path/to/outputdir/prefix_%d.png

Using photo model

Please add -model_dir models/photo to command line option, if you want to use photo model. For example,

th waifu2x.lua -model_dir models/photo -m scale -i input_image.png -o output_image.png

Video Encoding

* avconv is alias of ffmpeg on Ubuntu 14.04.

Extracting images and audio from a video. (range: 00:09:00 ~ 00:12:00)

mkdir frames
avconv -i data/raw.avi -ss 00:09:00 -t 00:03:00 -r 24 -f image2 frames/%06d.png
avconv -i data/raw.avi -ss 00:09:00 -t 00:03:00 audio.mp3

Generating a image list.

find ./frames -name "*.png" |sort > data/frame.txt

waifu2x (for example, noise reduction)

mkdir new_frames
th waifu2x.lua -m noise -noise_level 1 -resume 1 -l data/frame.txt -o new_frames/%d.png

Generating a video from waifu2xed images and audio.

avconv -f image2 -r 24 -i new_frames/%d.png -i audio.mp3 -r 24 -vcodec libx264 -crf 16 video.mp4

Train Your Own Model

Notes: If you have cuDNN library, you can use cudnn kernel with -backend cudnn option. And you can convert trained cudnn model to cunn model with tools/cudnn2cunn.lua.

Data Preparation

Genrating a file list.

find /path/to/image/dir -name "*.png" > data/image_list.txt

You should use noise free images. In my case, waifu2x is trained with 6000 high-resolution-noise-free-PNG images.

Converting training data.

th convert_data.lua

Train a Noise Reduction(level1) model

mkdir models/my_model
th train.lua -model_dir models/my_model -method noise -noise_level 1 -test images/miku_noisy.png
th cleanup_model.lua -model models/my_model/noise1_model.t7 -oformat ascii
# usage
th waifu2x.lua -model_dir models/my_model -m noise -noise_level 1 -i images/miku_noisy.png -o output.png

You can check the performance of model with models/my_model/noise1_best.png.

Train a Noise Reduction(level2) model

th train.lua -model_dir models/my_model -method noise -noise_level 2 -test images/miku_noisy.png
th cleanup_model.lua -model models/my_model/noise2_model.t7 -oformat ascii
# usage
th waifu2x.lua -model_dir models/my_model -m noise -noise_level 2 -i images/miku_noisy.png -o output.png

You can check the performance of model with models/my_model/noise2_best.png.

Train a 2x UpScaling model

th train.lua -model_dir models/my_model -method scale -scale 2 -test images/miku_small.png
th cleanup_model.lua -model models/my_model/scale2.0x_model.t7 -oformat ascii
# usage
th waifu2x.lua -model_dir models/my_model -m scale -scale 2 -i images/miku_small.png -o output.png

You can check the performance of model with models/my_model/scale2.0x_best.png.