Warning
THIS PROJECT IS NOT MAINTAINED ANYMORE.
Android Java bindings for tree-sitter. Currently used
in AndroidIDE
and sora-editor.
// main library
implementation 'com.itsaky.androidide.treesitter:android-tree-sitter:<version>'
// grammar libraries
// <language> is the name of the language grammar. e.g. 'java', 'python', 'xml', etc.
implementation 'com.itsaky.androidide.treesitter:tree-sitter-<language>:<version>'Before using tree sitter, load the native library when appropriate (e.g. in the onCreate() method of your Application class, or in your activity) :
TreeSitter.loadLibrary();The following grammars have been published to Maven central :
- Java
- JSON
- Kotlin
- Logs (for highlighting logs in AndroidIDE)
- Python
- XML
- 
Android NDK 
- 
JDK 11or newer.- JAVA_HOMEmust be set to a JDK installation containing header files in the- JAVA_HOME/includedirectory. The JDK bundled with Android Studio does not contain these headers.
 
- 
build-essential,nodejs,cmakeandninja-buildpackages are required for builds. Install them with :sudo apt install build-essential nodejs cmake ninja-build
- 
You'll also need to install rustandcargo(>= v1.74.0). Follow the instructions here to get started with the installation.
- 
tree-sitter-cli- To build grammars.
As tree-sitter is already included in the source (as a submodule), the tree-sitter-cli is built
from source and then used to build the grammars.
Read the documentation on how to
install pre-built versions of tree-sitter.
If you want to use a prebuilt tree-sitter binary (either manually installed or installed
with npm or cargo), make sure it is accessible in the PATH, then set
the TS_CLI_BUILD_FROM_SOURCE environment variable :
export TS_CLI_BUILD_FROM_SOURCE=falseIMPORTANT: Building on a Linux machine is recommended.
Clone this repo with :
git clone --recurse-submodules https://github.com/AndroidIDEOfficial/android-tree-sitterYou might need to install the Node packages required by the grammars. For example, the cpp grammar
requires you to install the packages:
# from the root directory of this project
cd grammars/cpp && npm install && cd -A normal Gradle build (./gradlew build) can be executed in order to build everything for Android
and the host OS. To build android-tree-sitter and the grammars only for the host OS, you can
execute buildForHost task on appropriate subprojects.
The Gradle modules for the grammars are almost identical, with only minor differences in the CMakeLists file and the Java binding class.
These Gradle modules are automatically generated by
the DynamicModulePlugin.
The generation process relies on
the grammars.json
file. More information about the structure of this JSON file can be found in
the README
under the grammars directory.
Apart from the DynamicModulePlugin, there
are other Gradle plugins
which are used to configure and build the grammars effectively.
The common configuration for the grammars can be found in
the build.gradle.kts
file. This is where you can make changes or adjustments to the module configuration that applies to
all grammars.
The generated modules are located in the rootDir/grammar-modules directory. This is where you can
find the output of the module generation process.
To add a new grammar to the project, follow these steps:
- 
Begin by adding the grammar source code to the grammarsdirectory. To accomplish this, you can add a submodule using the following command:git submodule add <remote_url> grammars/<language_name> 
- 
The language_nameshould be the simple name of the language, without thetree-sitter-prefix. This name is used to generate both the shared library and the Gradle module. For example, if thelanguage_nameisabc:- The module tree-sitter-abcwill be automatically generated.
- The name of the resulting shared library will be libtree-sitter-abc.so.
 
- The module 
- 
After adding the grammar source, update the grammars.jsonfile to include the newly added grammar in the project.
- 
Finally, sync the project to trigger the generation of the module for the newly added grammar. 
You have two ways to load grammars that are not published along with this project :
- Package the grammar with your application.
- Load the grammar at runtime using TSLanguage.loadLanguage.
TSLanguage.loadLanguage uses dlopen to load the library and must be CAREFULLY used. Also,
grammars that are loaded using this method must be closed when they are not used.
Prefer using the first method whenever possible.
You can package the grammar in your Android application as you would package any other shared library :
- Include the libtree-sitter-myLang.sofile in thejniLibsdirectory of your project.
- Create a native method in a Java class which will return the pointer to the language :
package com.my.app;
public class MyClass {
  static {
    System.loadLibrary("tree-sitter-myLang");
  }
  public static native long myLang();
}- Write the C/C++ implementation for the method :
extern "C" TSLanguage *tree_sitter_myLang();
extern "C"
JNIEXPORT jlong JNICALL
Java_com_my_app_MyClass_myLang(JNIEnv *env, jclass clazz) {
  // simply cast the language pointer to jlong
  return (jlong) tree_sitter_myLang();
}- Create and use the TSLanguageinstance :
final TSLanguage myLang=TSLanguage.create("myLang",MyClass.myLang());
// use it with TSParser
  try(final var parser=TSParser.create()){
  parser.setLanguage(myLang);
  ...
  }android-tree-sitter
v3.1.0or newer is required for this method.
TSLanguage provides loadLanguage(String, String) method which can be used to load the grammars
at runtime. This method uses dlopen to load the shared library, get the language instance and
return its pointer. Use this method CAREFULLY.
The language instances created using this method MUST be closed using TSLanguage.close().
Calling the close method ensures that the underlying dlopen'ed library handle is closed
using dlclose.
Usage :
// provide the path to the shared library and the name of the language
// the name is used to cache the language instance
// further invocations of this method with the same lang name returns the
// cached language instance
final TSLanguage myLang=TSLanguage.loadLanguage("/path/to/libtree-sitter-myLang.so","myLang");
  if(myLang!=null){
  // loaded successfully
  }else{
  // failed to load the language
  // see logcat for details
  }Use this language :
try(final var parser=TSParser.create()){
  parser.setLanguage(myLang);
  ...
  }You don't have to keep a reference to myLang. Once loaded, the language can be accessed
using TSLanguageCache :
// returns the 'myLang' instance i.e. both are same
final TSLanguage cachedMyLang=TSLanguageCache.get("myLang");DO NOT FORGET to close the language :
// this closes the underlying library handle
myLang.close();For examples, see the tests.
android-tree-sitter library is free software; you can redistribute it and/or
modify it under the terms of the GNU Lesser General Public
License as published by the Free Software Foundation; either
version 2.1 of the License, or (at your option) any later version.
android-tree-sitter library is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
Lesser General Public License for more details.
You should have received a copy of the GNU General Public License
along with android-tree-sitter.  If not, see <https://www.gnu.org/licenses/>.