Skip to content
View zhangisland's full-sized avatar
🥰
🥰

Block or report zhangisland

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 11,897 838 Updated Sep 13, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 6,258 651 Updated Aug 12, 2024

The Screen Annotation dataset consists of pairs of mobile screenshots and their annotations. The annotations are in text format, and describe the UI elements present on the screen: their type, loca…

46 7 Updated Mar 7, 2024

An accurate GUI element detection approach based on old-fashioned CV algorithms [Upgraded on 5/July/2021]

Python 365 97 Updated Nov 8, 2023

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 29,931 2,739 Updated Sep 17, 2024

[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898

Python 182 31 Updated May 5, 2024

SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.

Python 955 91 Updated Aug 23, 2024

The official Meta Llama 3 GitHub site

Python 26,127 2,933 Updated Aug 12, 2024

Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

TypeScript 65,676 3,575 Updated Sep 17, 2024

A UI-Focused Agent for Windows OS Interaction.

Python 7,566 1,037 Updated Sep 13, 2024

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 2,676 245 Updated Sep 3, 2024

An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.

Python 542 39 Updated Sep 10, 2024

Harness LLMs with Multi-Agent Programming

Python 2,294 216 Updated Sep 13, 2024

A programming framework for agentic AI 🤖

Jupyter Notebook 30,848 4,500 Updated Sep 14, 2024

Official repo for MM-REACT

Python 927 69 Updated Jan 31, 2024

Visualizer for neural network, deep learning and machine learning models

JavaScript 27,558 2,742 Updated Sep 17, 2024

The official Python library for the OpenAI API

Python 22,033 3,037 Updated Sep 16, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,327 461 Updated Aug 19, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 13,296 1,080 Updated Sep 2, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,872 401 Updated May 29, 2024

Generative Agents: Interactive Simulacra of Human Behavior

16,226 2,082 Updated Aug 5, 2024

[ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning

Python 162 10 Updated Apr 15, 2024

Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow

Python 1,368 139 Updated Apr 3, 2024

Universal LLM Deployment Engine with ML Compilation

Python 18,649 1,512 Updated Sep 16, 2024

A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)

Python 67 9 Updated Jul 11, 2023

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"

Jupyter Notebook 654 94 Updated Jul 30, 2024

Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"

Python 234 32 Updated Mar 22, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,784 361 Updated Aug 7, 2024

A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.

Python 10,162 1,232 Updated Aug 20, 2024

Windows GUI Automation with Python (based on text properties)

Python 4,878 689 Updated Aug 21, 2024
Nächste