Space complexity

From HandWiki
Short description: Computer memory needed by an algorithm

The space complexity of an algorithm or a data structure is the amount of memory space required to solve an instance of the computational problem as a function of characteristics of the input. It is the memory required by an algorithm until it executes completely.[1] This includes the memory space used by its inputs, called input space, and any other (auxiliary) memory it uses during execution, which is called auxiliary space.

Similar to time complexity, space complexity is often expressed asymptotically in big O notation, such as [math]\displaystyle{ O(n), }[/math] [math]\displaystyle{ O(n\log n), }[/math] [math]\displaystyle{ O(n^\alpha), }[/math] [math]\displaystyle{ O(2^n), }[/math] etc., where n is a characteristic of the input influencing space complexity.

Space complexity classes

Analogously to time complexity classes DTIME(f(n)) and NTIME(f(n)), the complexity classes DSPACE(f(n)) and NSPACE(f(n)) are the sets of languages that are decidable by deterministic (respectively, non-deterministic) Turing machines that use [math]\displaystyle{ O(f(n)) }[/math] space. The complexity classes PSPACE and NPSPACE allow [math]\displaystyle{ f }[/math] to be any polynomial, analogously to P and NP. That is, [math]\displaystyle{ \mathsf{PSPACE} = \bigcup_{c \in \Z^+} \mathsf{DSPACE}(n^c) }[/math] and [math]\displaystyle{ \mathsf{NPSPACE} = \bigcup_{c \in \Z^+} \mathsf{NSPACE}(n^c) }[/math]

Relationships between classes

The space hierarchy theorem states that, for all space-constructible functions [math]\displaystyle{ f(n), }[/math] there exists a problem that can be solved by a machine with [math]\displaystyle{ f(n) }[/math] memory space, but cannot be solved by a machine with asymptotically less than [math]\displaystyle{ f(n) }[/math] space.

The following containments between complexity classes hold.[2] [math]\displaystyle{ \mathsf{DTIME}(f(n)) \subseteq \mathsf{DSPACE}(f(n)) \subseteq \mathsf{NSPACE}(f(n)) \subseteq \mathsf{DTIME}\left(2^{O(f(n))}\right) }[/math]

Furthermore, Savitch's theorem gives the reverse containment that if [math]\displaystyle{ f \in \Omega(\log(n)), }[/math] [math]\displaystyle{ \mathsf{NSPACE}(f(n)) \subseteq \mathsf{DSPACE}\left((f(n))^2\right). }[/math]

As a direct corollary, [math]\displaystyle{ \mathsf{PSPACE} = \mathsf{NPSPACE}. }[/math] This result is surprising because it suggests that non-determinism can reduce the space necessary to solve a problem only by a small amount. In contrast, the exponential time hypothesis conjectures that for time complexity, there can be an exponential gap between deterministic and non-deterministic complexity.

The Immerman–Szelepcsényi theorem states that, again for [math]\displaystyle{ f\in\Omega(\log(n)), }[/math] [math]\displaystyle{ \mathsf{NSPACE}(f(n)) }[/math] is closed under complementation. This shows another qualitative difference between time and space complexity classes, as nondeterministic time complexity classes are not believed to be closed under complementation; for instance, it is conjectured that NP ≠ co-NP.[3][4]

LOGSPACE

Main page: L (complexity)

L or LOGSPACE is the set of problems that can be solved by a deterministic Turing machine using only [math]\displaystyle{ O(\log n) }[/math] memory space with regards to input size. Even a single counter that can index the entire [math]\displaystyle{ n }[/math]-bit input requires [math]\displaystyle{ \log n }[/math] space, so LOGSPACE algorithms can maintain only a constant number of counters or other variables of similar bit complexity.

LOGSPACE and other sub-linear space complexity is useful when processing large data that cannot fit into a computer's RAM. They are related to Streaming algorithms, but only restrict how much memory can be used, while streaming algorithms have further constraints on how the input is fed into the algorithm. This class also sees use in the field of pseudorandomness and derandomization, where researchers consider the open problem of whether L = RL.[5][6]

The corresponding nondeterministic space complexity class is NL.

Auxiliary space complexity

The term auxiliary space refers to space other than that consumed by the input. Auxiliary space complexity could be formally defined in terms of a Turing machine with a separate input tape which cannot be written to, only read, and a conventional working tape which can be written to. The auxiliary space complexity is then defined (and analyzed) via the working tape. For example, consider the depth-first search of a balanced binary tree with [math]\displaystyle{ n }[/math] nodes: its auxiliary space complexity is [math]\displaystyle{ \Theta(\log n). }[/math]

See also

References

  1. Kuo, Way; Zuo, Ming J. (2003), Optimal Reliability Modeling: Principles and Applications, John Wiley & Sons, p. 62, ISBN 9780471275459, https://books.google.com/books?id=vdZ4Bm-LnHMC&pg=PA62 
  2. Arora, Sanjeev; Barak, Boaz (2007), Computational Complexity : A Modern Approach (draft ed.), p. 76, ISBN 9780511804090, https://theory.cs.princeton.edu/complexity/book.pdf 
  3. "Nondeterministic space is closed under complementation", SIAM Journal on Computing 17 (5): 935–938, 1988, doi:10.1137/0217058, http://www.cs.umass.edu/~immerman/pub/space.pdf 
  4. "The method of forcing for nondeterministic automata", Bulletin of the EATCS 33: 96–100, 1987 
  5. "RL ⊆ SC", Proceedings of the 24th ACM Symposium on Theory of computing (STOC '92), Victoria, British Columbia, Canada, 1992, pp. 619–623, doi:10.1145/129712.129772 .
  6. "Pseudorandom walks on regular digraphs and the RL vs. L problem", STOC'06: Proceedings of the 38th Annual ACM Symposium on Theory of Computing, New York: ACM, 2006, pp. 457–466, doi:10.1145/1132516.1132583, http://people.seas.harvard.edu/~salil/research/regular.pdf