Project structure of: OthersideAI/self-operating-computer
evaluate.pyEvaluate image adherence with GPT-4 Vision and color-coded messages.actions.pyGenerate content with AI prompts and OpenAI API.dialog.pyDialog system with error handling and input checksexceptions.pyException class for unrecognized modelmain.pySelf-Operating Computer Main Entry Pointprompts.pyAI-assisted Chrome, Docs, Sheets prompts with contextual options.settings.pyManages settings, loads .env, initializes OpenAI client.label.pyImage data handling and encoding utilities.misc.pyConverts percentages, extracts JSON and classifies responses.os.pyOS Utilities: Search, Input, Click, Move. Last Assistant Message.screenshot.pyScreenshot capture and grid addition with PIL.style.pyStyle utility with PromptStyle and ANSI check
README.mdSelf-Operating Computer Framework: Enhancing multimodal models for mouse click predictions and API access.requirements-audio.txtCapture quiet or whispered audio with Whisper-Mic.requirements.txtPython packages: aiohttp 3.9.1, ultralytics 8.0.227run.shInstall Linux, configure .env with OpenAI key.setup.pySetup script for self-operating-computer package using setuptools.