All Projects → douban → PyCharlockHolmes

douban / PyCharlockHolmes

Licence: BSD-3-Clause license
Character encoding detecting library for Python using ICU and libmagic.

Programming Languages

common lisp
692 projects
python
139335 projects - #7 most used programming language
c
50402 projects - #5 most used programming language

Charlock Holmes

Build Status

Character encoding detecting library for Python using ICU and libmagic. Inspired by Charlock Holmes

Dependency

  1. icu
  2. file(libmagic)

Gentoo

emerge -av dev-libs/icu
emerge -av sys-apps/file

Ubuntu

apt-get install libicu-dev
apt-get install libmagic-dev

Brew

brew install icu4c
brew install libmagic
export ICUI18N="/usr/local/Cellar/icu4c/xx" # Replace "xx" as the version of your icu
export MAGIC="/usr/local/Cellar/libmagic/xx" # Replace "xx" as the version of your libmagic

Install

python setup build
python setup install

Usage

from charlockholmes import detect
file = open('test.txt')
content = file.read()
print detect(content)

CHANGELOG

  • 0.0.3
    • Add support for python 3.X

License

Modified BSD License

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].