Low Level Virtual Machine

From Wikipedia, the free encyclopedia

Jump to: navigation, search
LLVM
Developed by LLVM Developer Group & Apple, Inc
Initial release 2000
Latest release 2.4 / 9 November 2008
OS Cross-platform
Type Compiler
License University of Illinois/NCSA Open Source License
Website http://llvm.org/

The Low Level Virtual Machine, generally known as LLVM, is a compiler infrastructure, written in C++, which is designed for compile-time, link-time, run-time, and "idle-time" optimization of programs written in arbitrary imperative programming languages. The LLVM project started in 2000 at the University of Illinois at Urbana-Champaign.

LLVM currently supports the compilation of C, C++, Objective-C, Ada, Brainfuck, and Fortran programs, using front-ends derived from version 4.0.1 and 4.2 of the GNU Compiler Collection (GCC). An alternative front-end for C-like languages is being developed under the name clang.

Using LLVM, one can create compilers for languages like Java, code generators for specific machine architectures, and optimizers independent from particular platforms or languages. The LLVM intermediate representation (IR) is language and architecture independent; it lies between a language-specific module and a code generator for a specific machine. LLVM includes aggressive inter-procedural optimization support, static and JIT compilers.

It has many components in various stages of development; including, but not limited to, a Java bytecode front-end, a CIL front-end, a CPython front-end, and a new graph coloring register allocator. The LLVM JIT compiler is capable of optimizing unnecessary static branches out of a program at runtime, and is therefore useful for partial evaluation in cases where a program has many options—most of which can easily be determined unnecessary in a specific environment. Because of this, it is used in the OpenGL pipeline of Mac OS X v10.5 (Leopard) to provide support for missing hardware features.[1]

Contents

[edit] Code representation

LLVM supports a language-independent instruction set and type system. Most of the instructions have a form similar to three address code. Each instruction is in static single assignment form (SSA), meaning that each variable (called a typed register) is assigned once and is frozen. This helps simplify the analysis of dependencies among variables.

Any form of type conversion, from coercion to the downcasting of an object, is performed explicitly using the cast instruction. LLVM has basic types, like integers of fixed sizes, and exactly five derived types: pointers, arrays, vectors, structures, and functions. A type construct in a concrete language can be represented by combining these basic types in LLVM. For example, a class in C++ can be represented by a combination of structures, functions and arrays of function pointers.

[edit] Licensing

LLVM and clang are publicly available under the University of Illinois/NCSA Open Source License, a GPL-compatible,[2] OSI-approved license that is very similar to the BSD license.

[edit] References

[edit] See also

[edit] External links

Personal tools