At Nubificus, we are exploring systems software optimizations for deploying lightweight applications in the Cloud and at the Edge. Based on existing open-source tools and frameworks we mix and match application dependencies and tailor the Operating Systems layer to match the applications’ requirements. We are a fully distributed company working from the UK, Greece & Spain.
Does hacking the OS/application stack sound like your kind of project? Do you see yourself change the way users around the world deploy their applications? If the answer is yes, we would love to have a chat and welcome you to our team.
Please send an email to jobs@nubificus.com including:
Make sure to include the job ID in the subject.
In all positions below, we offer:
Nubificus LTD is an equal opportunity employer. We want applicants of diverse background and hire without regard to age, gender, color, religion, national origin, or any other individual characteristic.
We are currently looking for candidates for the following positions:
Responsibilities:
Requirements:
Location:
Responsibilities:
Requirements:
Location:
Responsibilities:
Requirements:
Location:
Internship title: Explore TF internal operations to be offloaded to a generic transport mechanism
Short description:
Tensorflow supports offloading computations on hardware accelerators (GPUs, TPUs). In order to use an accelerator, a Tensorflow instance currently needs direct access to the relevant hardware. Although this requirement doesn’t prevent distributed deployments (either by Tensorflow itself or by higher level frameworks like pycharm) in heterogeneous hardware, it limits flexibility of integrating hardware accelerators to virtualized setups where Tensorflow applications are executed, especially when there are latency constraints (ie. AWS Lambda/Firecracker). Adding a generic transport mechanism to Tensorflow’s internal operations could provide the necessary abstraction to efficiently use accelerators in all aforementioned environments.
Responsibilities:
Essential skills:
Desirable skills and experience:
Location:
Internship title: BLAS bindings to the vAccel framework
Short description:
The BLAS (Basic Linear Algebra Subprograms) are routines that provide standard building blocks for performing basic vector and matrix operations. The Level 1 BLAS perform scalar, vector and vector-vector operations, the Level 2 BLAS perform matrix-vector operations, and the Level 3 BLAS perform matrix-matrix operations.
vAccel is a framework offering hardware acceleration primitives with focus on portability. It exposes “accelerate-able” functions to programmers and abstracts away hardware complexity by means of a pluggable design.
Adding BLAS support to vAccel enables users to enjoy efficient application execution without direct access to hardware accelerators, making the framework ideal to be deployed in a multi-tenant cloud environment.
Responsibilities:
Essential skills:
Desirable skills and experience:
Location: