gensim 4.4.0


pip install gensim

  Latest version

Released: Oct 18, 2025

Project Links

Meta
Author: Radim Rehurek
Requires Python: >=3.9

Classifiers

Development Status
  • 5 - Production/Stable

Environment
  • Console

Intended Audience
  • Science/Research

Operating System
  • OS Independent

Programming Language
  • Python :: 3.9
  • Python :: 3.10
  • Python :: 3.11
  • Python :: 3.12
  • Python :: 3.13
  • Python :: 3 :: Only

Topic
  • Scientific/Engineering :: Artificial Intelligence
  • Scientific/Engineering :: Information Analysis
  • Text Processing :: Linguistic

GA Wheel

Gensim is a Python library for topic modelling, document indexing and similarity retrieval with large corpora. Target audience is the natural language processing (NLP) and information retrieval (IR) community.

Features

  • All algorithms are memory-independent w.r.t. the corpus size (can process input larger than RAM, streamed, out-of-core)

  • Intuitive interfaces

    • easy to plug in your own input corpus/datastream (simple streaming API)

    • easy to extend with other Vector Space algorithms (simple transformation API)

  • Efficient multicore implementations of popular algorithms, such as online Latent Semantic Analysis (LSA/LSI/SVD), Latent Dirichlet Allocation (LDA), Random Projections (RP), Hierarchical Dirichlet Process (HDP) or word2vec deep learning.

  • Distributed computing: can run Latent Semantic Analysis and Latent Dirichlet Allocation on a cluster of computers.

  • Extensive documentation and Jupyter Notebook tutorials.

If this feature list left you scratching your head, you can first read more about the Vector Space Model and unsupervised document analysis on Wikipedia.

Installation

This software depends on NumPy and Scipy, two Python packages for scientific computing. You must have them installed prior to installing gensim.

It is also recommended you install a fast BLAS library before installing NumPy. This is optional, but using an optimized BLAS such as MKL, ATLAS or OpenBLAS is known to improve performance by as much as an order of magnitude. On OSX, NumPy picks up its vecLib BLAS automatically, so you don’t need to do anything special.

Install the latest version of gensim:

pip install --upgrade gensim

Or, if you have instead downloaded and unzipped the source tar.gz package:

python setup.py install

For alternative modes of installation, see the documentation.

Gensim is being continuously tested under all supported Python versions. Support for Python 2.7 was dropped in gensim 4.0.0 – install gensim 3.8.3 if you must use Python 2.7.

How come gensim is so fast and memory efficient? Isn’t it pure Python, and isn’t Python slow and greedy?

Many scientific algorithms can be expressed in terms of large matrix operations (see the BLAS note above). Gensim taps into these low-level BLAS libraries, by means of its dependency on NumPy. So while gensim-the-top-level-code is pure Python, it actually executes highly optimized Fortran/C under the hood, including multithreading (if your BLAS is so configured).

Memory-wise, gensim makes heavy use of Python’s built-in generators and iterators for streamed data processing. Memory efficiency was one of gensim’s design goals, and is a central feature of gensim, rather than something bolted on as an afterthought.

Documentation

Citing gensim

When citing gensim in academic papers and theses, please use this BibTeX entry:

@inproceedings{rehurek_lrec,
      title = {{Software Framework for Topic Modelling with Large Corpora}},
      author = {Radim {\v R}eh{\r u}{\v r}ek and Petr Sojka},
      booktitle = {{Proceedings of the LREC 2010 Workshop on New
           Challenges for NLP Frameworks}},
      pages = {45--50},
      year = 2010,
      month = May,
      day = 22,
      publisher = {ELRA},
      address = {Valletta, Malta},
      language={English}
}

Gensim is open source software released under the GNU LGPLv2.1 license. Copyright (c) 2009-now Radim Rehurek

4.4.0 Oct 18, 2025
4.3.3 Jul 19, 2024
4.3.2 Aug 24, 2023
4.3.1 Mar 10, 2023
4.3.0 Dec 21, 2022
4.2.0 May 01, 2022
4.1.2 Sep 17, 2021
4.1.1 Sep 14, 2021
4.1.0 Aug 29, 2021
4.0.1 Apr 01, 2021
4.0.0 Mar 25, 2021
3.8.3 May 04, 2020
3.8.2 Apr 12, 2020
3.8.1 Sep 26, 2019
3.8.0 Jul 09, 2019
3.7.3 May 08, 2019
3.7.2 Apr 10, 2019
3.7.1 Jan 31, 2019
3.7.0 Jan 18, 2019
3.6.0 Sep 20, 2018
3.5.0 Jul 06, 2018
3.4.0 Mar 01, 2018
3.3.0 Feb 02, 2018
3.2.0 Dec 09, 2017
3.1.0 Nov 06, 2017
3.0.0 Sep 27, 2017
2.3.0 Jul 25, 2017
2.2.0 Jun 21, 2017
2.1.0 May 12, 2017
2.0.0 Apr 10, 2017
1.0.1 Mar 03, 2017
1.0.0 Feb 25, 2017
0.13.4 Dec 25, 2016
0.13.3 Oct 21, 2016
0.13.2 Aug 26, 2016
0.13.1 Jun 24, 2016
0.13.0 Jun 22, 2016
0.12.4 Jan 31, 2016
0.12.3 Nov 06, 2015
0.12.2 Sep 19, 2015
0.12.1 Jul 20, 2015
0.12.0 Jul 06, 2015
0.11.1 Apr 11, 2015
0.10.3 Nov 19, 2014
0.10.2 Sep 18, 2014
0.10.1 Jul 22, 2014
0.10.0 Jun 04, 2014
0.9.1 Apr 12, 2014
0.9.0 Mar 15, 2014
0.8.9 Dec 26, 2013
0.8.8 Nov 03, 2013
0.8.7 Sep 18, 2013
0.8.6 Sep 15, 2012
0.8.5 Jul 22, 2012
0.8.4 Mar 09, 2012
0.8.3 Dec 02, 2011
0.8.2 Oct 31, 2011
0.8.1 Oct 10, 2011
0.8.0 Jun 28, 2011
0.7.8 Mar 26, 2011
0.7.7 Feb 13, 2011
0.7.6 Jan 10, 2011
0.7.5 Nov 03, 2010
0.7.4 Sep 13, 2010
0.7.3 Sep 07, 2010
0.7.2 Sep 01, 2010
0.7.1 Aug 28, 2010
0.7.0 Aug 28, 2010
0.6.0 Jun 19, 2010
0.5.0 Apr 28, 2010
0.4.7 Apr 27, 2010
0.4.6 Apr 17, 2010
0.4.5 Apr 05, 2010
0.4.4 Mar 30, 2010
0.4.3 Mar 29, 2010
0.4.2 Mar 28, 2010
0.4.1 Mar 19, 2010
0.4 Mar 19, 2010
0.3.0 Mar 18, 2010
0.2 Mar 18, 2010
3.8.2.win32 Apr 12, 2020
3.8.2.win Apr 12, 2020
3.8.1.win32 Sep 26, 2019
3.8.1.win Sep 26, 2019
3.7.1.win32 Jan 31, 2019
3.7.1.win Jan 31, 2019
3.7.0.win32 Jan 18, 2019
3.7.0.win Jan 18, 2019
3.6.0.win32 Sep 20, 2018
3.6.0.win Sep 20, 2018
3.5.0.win32 Jul 06, 2018
3.5.0.win Jul 06, 2018
3.4.0.win32 Mar 01, 2018
3.4.0.win Mar 01, 2018
3.3.0.win32 Feb 02, 2018
3.3.0.win Feb 02, 2018
3.2.0.win32 Dec 09, 2017
3.2.0.win Dec 09, 2017
3.0.0.win32 Sep 27, 2017
3.0.0.win Sep 27, 2017
2.3.0.win32 Jul 25, 2017
2.3.0.win Jul 25, 2017
2.2.0.win32 Jun 21, 2017
2.2.0.win Jun 21, 2017
2.1.0.win32 May 12, 2017
2.1.0.win May 12, 2017
2.0.0.win32 Apr 11, 2017
2.0.0.win Apr 11, 2017
1.0.1.win32 Mar 03, 2017
1.0.1.win Mar 03, 2017
1.0.0.win32 Feb 25, 2017
1.0.0.win Feb 25, 2017
0.13.4.win32 Dec 25, 2016
0.13.4.win Dec 25, 2016
0.13.3.win32 Oct 21, 2016
0.13.3.win Oct 21, 2016
0.13.2.win32 Aug 26, 2016
0.13.2.win Aug 26, 2016
0.13.1.win32 Jun 24, 2016
0.13.1.win Jun 24, 2016
0.12.4.win32 Jan 31, 2016
0.12.4.win Jan 31, 2016

Wheel compatibility matrix

Platform CPython 3.9 CPython 3.10 CPython 3.11 CPython 3.12 CPython 3.13
macosx_10_13_x86_64
macosx_10_9_x86_64
macosx_11_0_arm64
manylinux_2_24_aarch64
manylinux_2_24_x86_64
manylinux_2_28_aarch64
manylinux_2_28_x86_64
win_amd64

Files in release

gensim-4.4.0-cp310-cp310-macosx_10_9_x86_64.whl (23.3MiB)
gensim-4.4.0-cp310-cp310-macosx_11_0_arm64.whl (23.3MiB)
gensim-4.4.0-cp310-cp310-manylinux_2_24_aarch64.manylinux_2_28_aarch64.whl (26.3MiB)
gensim-4.4.0-cp310-cp310-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl (26.4MiB)
gensim-4.4.0-cp310-cp310-win_amd64.whl (23.3MiB)
gensim-4.4.0-cp311-cp311-macosx_10_9_x86_64.whl (23.3MiB)
gensim-4.4.0-cp311-cp311-macosx_11_0_arm64.whl (23.3MiB)
gensim-4.4.0-cp311-cp311-manylinux_2_24_aarch64.manylinux_2_28_aarch64.whl (26.5MiB)
gensim-4.4.0-cp311-cp311-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl (26.5MiB)
gensim-4.4.0-cp311-cp311-win_amd64.whl (23.3MiB)
gensim-4.4.0-cp312-cp312-macosx_10_13_x86_64.whl (23.3MiB)
gensim-4.4.0-cp312-cp312-macosx_11_0_arm64.whl (23.3MiB)
gensim-4.4.0-cp312-cp312-manylinux_2_24_aarch64.manylinux_2_28_aarch64.whl (26.5MiB)
gensim-4.4.0-cp312-cp312-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl (26.6MiB)
gensim-4.4.0-cp312-cp312-win_amd64.whl (23.3MiB)
gensim-4.4.0-cp313-cp313-macosx_10_13_x86_64.whl (23.3MiB)
gensim-4.4.0-cp313-cp313-macosx_11_0_arm64.whl (23.3MiB)
gensim-4.4.0-cp313-cp313-manylinux_2_24_aarch64.manylinux_2_28_aarch64.whl (26.5MiB)
gensim-4.4.0-cp313-cp313-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl (26.5MiB)
gensim-4.4.0-cp313-cp313-win_amd64.whl (23.3MiB)
gensim-4.4.0-cp39-cp39-macosx_10_9_x86_64.whl (23.3MiB)
gensim-4.4.0-cp39-cp39-macosx_11_0_arm64.whl (23.3MiB)
gensim-4.4.0-cp39-cp39-manylinux_2_24_aarch64.manylinux_2_28_aarch64.whl (26.3MiB)
gensim-4.4.0-cp39-cp39-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl (26.3MiB)
gensim-4.4.0-cp39-cp39-win_amd64.whl (23.3MiB)
gensim-4.4.0.tar.gz (22.2MiB)