All Projects → faisalthaheem → tensorflow-windows

faisalthaheem / tensorflow-windows

Licence: MIT License
TensorFlow builds compiled on windows with avx and avx2 extensions

Projects that are alternatives of or similar to tensorflow-windows

ClothTOP
GPU-accelerated Cloth TOP node for TouchDesigner using the NVIDIA Flex physics solver.
Stars: ✭ 33 (+65%)
Mutual labels:  cuda
cresset
Template repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.
Stars: ✭ 573 (+2765%)
Mutual labels:  cuda
lbvh
an implementation of parallel linear BVH (LBVH) on GPU
Stars: ✭ 67 (+235%)
Mutual labels:  cuda
cuda-toolkit
GitHub Action to install CUDA
Stars: ✭ 34 (+70%)
Mutual labels:  cuda
PbfVs
Implementation of Macklin, Miles, and Matthias Müller. "Position based fluids.". Visual Studio 2015 + CUDA 8.0
Stars: ✭ 100 (+400%)
Mutual labels:  cuda
MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+1990%)
Mutual labels:  cuda
Fat-Clouds
GPU Fluid Simulation with Volumetric Rendering
Stars: ✭ 81 (+305%)
Mutual labels:  cuda
QPT
[内测中]前向式Python环境快捷封装工具,快速将Python打包为EXE并添加CUDA、NoAVX等支持。
Stars: ✭ 308 (+1440%)
Mutual labels:  cuda
warp
continuous energy monte carlo neutron transport in general geometries on GPUs
Stars: ✭ 27 (+35%)
Mutual labels:  cuda
ThrustRTC
CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.
Stars: ✭ 41 (+105%)
Mutual labels:  cuda
Jampack
Experimental parallel compression algorithm
Stars: ✭ 21 (+5%)
Mutual labels:  cuda
bazel.cmake
bazel.cmake mimics the behavior of bazel to simplify the usability of CMake
Stars: ✭ 38 (+90%)
Mutual labels:  cuda
mini-nbody
A simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.
Stars: ✭ 73 (+265%)
Mutual labels:  cuda
SoliditySHA3Miner
All-in-one mixed multi-GPU (nVidia, AMD, Intel) & CPU miner solves proof of work to mine supported EIP918 tokens in a single instance (with API).
Stars: ✭ 28 (+40%)
Mutual labels:  cuda
octotiger
Astrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees
Stars: ✭ 30 (+50%)
Mutual labels:  cuda
Arch-Data-Science
Archlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
Stars: ✭ 92 (+360%)
Mutual labels:  cuda
dynamic-occupancy-grid-map
Implementation of A Random Finite Set Approach for Dynamic Occupancy Grid Maps with Real-Time Application
Stars: ✭ 89 (+345%)
Mutual labels:  cuda
CudaSHA256
Simple tool to calculate sha256 on GPU using Cuda
Stars: ✭ 38 (+90%)
Mutual labels:  cuda
revisiting-sepconv
an implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch
Stars: ✭ 43 (+115%)
Mutual labels:  cuda
disptools
Generate displacement fields with known volume changes
Stars: ✭ 17 (-15%)
Mutual labels:  cuda

TensorFlow windows builds with AVX/AVX2 extensions

Checkout the releases section for the python wheels.

TF Version Python Version Instruction set GPU Enabled Link
1.8.0 3.5.3 AVX2 tensorflow-1.8.0-cp35-cp35m-win_amd64.whl
1.7.1 3.5.3 AVX2 Yes tensorflow_gpu-1.7.1-cp35-cp35m-win_amd64-gpu.whl
1.7.1 3.5.3 AVX2 tensorflow-1.7.1-cp35-cp35m-win_amd64.whl
1.7.0 3.5.3 AVX2 tensorflow-1.7.0-cp35-cp35m-win_amd64.whl
1.7.0 3.5.3 AVX2 Yes tensorflow_gpu-1.7.0-cp35-cp35m-win_amd64-avx2-cuda9.whl
1.6.0 3.5.3 AVX2 tensorflow-1.6.0-cp35-cp35m-win_amd64-avx2.whl
1.5.1 3.5.3 AVX2 tensorflow-1.5.1-cp35-cp35m-win_amd64-avx2.whl
1.4.0 3.5.3 AVX2 tensorflow-1.4.0-cp35-cp35m-win_amd64.whl
1.4.0 3.5.3 AVX tensorflow-1.4.0-cp35-cp35m-win_amd64-avx.whl

Build Environment

Environment Version
OS Windows 10 x64
Visual Sutdio 2017 Community Edition for CPU only builds & 2015 Community Edition for CUDA enabled builds
CMAKE 3.9.0
SWIG swigwin-3.0.12
Python 3.5.3

Followed official guide at https://github.com/tensorflow/tensorflow/blob/master/tensorflow/contrib/cmake/README.md

Build Commands

CMAKE Configuration Command for AVX/AVX2 non CUDA Builds using VS 2017

cmake.exe .. ^
-A x64 ^
-G "Visual Studio 15 2017" ^
-DSWIG_EXECUTABLE="c:/swigwin-3.0.12/swig.exe" ^
-DPYTHON_EXECUTABLE="c:/py/35/python.exe" ^
-DCMAKE_BUILD_TYPE=Release ^
-DPYTHON_LIBRARIES="c:/py/35/libs/python35.lib" ^
-Dtensorflow_BUILD_PYTHON_TESTS=OFF ^
-Dtensorflow_BUILD_CC_TESTS=OFF ^
-Dtensorflow_TF_NIGHTLY=OFF ^
-Dtensorflow_WIN_CPU_SIMD_OPTIONS=/arch:AVX2

Or when using CUDA, use the following command to build using toolchain 140 with VS 2015

cmake.exe .. ^
-A x64 ^
-G "Visual Studio 14 2015" -T v140 ^
-DSWIG_EXECUTABLE="C:/tools/swigwin-3.0.12/swig.exe" ^
-DPYTHON_EXECUTABLE="C:/Program Files/Python35/python.exe" ^
-DCMAKE_BUILD_TYPE=Release ^
-DPYTHON_LIBRARIES="C:/Program Files/Python35/libs/python35.lib" ^
-Dtensorflow_BUILD_PYTHON_TESTS=OFF ^
-Dtensorflow_BUILD_CC_TESTS=OFF ^
-Dtensorflow_TF_NIGHTLY=OFF ^
-Dtensorflow_WIN_CPU_SIMD_OPTIONS=/arch:AVX2 ^
-Dtensorflow_ENABLE_GPU=ON ^
-DCUDNN_HOME="C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v9.0"

Build with following command using "Developer Command Prompt for VS 2017" or "Developer Command Prompt for VS 2015"

msbuild.exe /p:Configuration=Release /maxcpucount:1 /verbosity:minimal tf_python_build_pip_package.vcxproj

Make sure 64bit build tools are used, rename "Hostx64" to "Hostx86" at following default install location for VS 2017

C:\Program Files (x86)\Microsoft Visual Studio\2017\Community\VC\Tools\MSVC\14.11.25503\bin

and for VS 2015 rename "amd64" to "x86_amd64" at the following default install location

C:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\bin
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].