coolify

Author	SHA1	Message	Date
Andras Bacsai	236745ede1	chore: prepare for PR	2026-03-01 18:49:40 +01:00
Andras Bacsai	31555f9e8a	fix(jobs): prevent non-due jobs firing on restart and enrich skip logs with resource links - Refactor shouldRunNow() to only fire on first run (empty cache) if actually due by cron schedule, preventing spurious executions after cache loss or service restart - Add enrichSkipLogsWithLinks() method to fetch and populate resource names and links for tasks, backups, and docker cleanup jobs in skip logs - Update skip logs UI to display resource column with links to related resources, improving navigation and context - Add fallback display when linked resources are deleted - Expand tests to cover both restart scenarios: non-due jobs (should not fire) and due jobs (should fire)	2026-02-28 18:03:29 +01:00
Andras Bacsai	63be5928ab	feat(scheduler): add pagination to skipped jobs and filter manager start events - Implement pagination for skipped jobs display with 20 items per page - Add pagination controls (previous/next buttons) to the scheduled jobs view - Exclude ScheduledJobManager "started" events from run logs, keeping only "completed" events - Add ShouldBeEncrypted interface to ScheduledTaskJob for secure queue handling - Update log filtering to fetch 500 recent skips and slice for pagination - Use Log facade instead of fully qualified class name	2026-02-28 16:23:58 +01:00
Andras Bacsai	a0c177f6f2	feat(jobs): add queue delay resilience to scheduled job execution Implement dedup key-based cron tracking to make scheduled jobs resilient to queue delays. Even if a job is delayed by minutes, it will catch the missed cron window by tracking previousRunDate in cache instead of relying on isDue() alone. - Add dedupKey parameter to shouldRunNow() in ScheduledJobManager - When provided, uses getPreviousRunDate() + cache tracking for resilience - Falls back to isDue() for docker cleanups without dedup key - Prevents double-dispatch within same cron window - Optimize ServerConnectionCheckJob dispatch - Skip SSH checks if Sentinel is healthy (enabled and live) - Reduces redundant checks when Sentinel heartbeat proves connectivity - Remove hourly Sentinel update checks - Consolidate to daily CheckAndStartSentinelJob dispatch - Crash recovery handled by sentinelOutOfSync → ServerCheckJob flow - Add logging for skipped database backups with context (backup_id, database_id, status) - Refactor skip reason methods to accept server parameter, avoiding redundant queries - Add comprehensive test suite for scheduling with various delay scenarios and timezones	2026-02-28 15:06:25 +01:00
Andras Bacsai	f68793ed69	feat(jobs): optimize async job dispatches and enhance Stripe subscription sync Reduce unnecessary job queue pressure and improve subscription sync reliability: - Cache ServerStorageCheckJob dispatch to only trigger on disk percentage changes - Rate-limit ConnectProxyToNetworksJob to maximum once per 10 minutes - Add progress callback support to SyncStripeSubscriptionsJob for UI feedback - Implement bulk fetching of valid Stripe subscription IDs for efficiency - Detect and report resubscribed users (same email, different customer ID) - Fix CleanupUnreachableServers query operator (>= 3 instead of = 3) - Improve empty subId validation in PushServerUpdateJob - Optimize relationship access by using properties instead of query methods - Add comprehensive test coverage for all optimizations	2026-02-28 13:18:44 +01:00
Andras Bacsai	c93296e9a6	feat(healthcheck): add command-based health check support (#8612 )	2026-02-25 12:09:59 +01:00
Andras Bacsai	b88f9fca67	chore: prepare for PR	2026-02-25 12:07:29 +01:00
Andras Bacsai	fe36b70680	chore: prepare for PR	2026-02-25 12:00:24 +01:00
Andras Bacsai	521d995ea1	Merge remote-tracking branch 'origin/next' into 7765-healthcheck-investigation	2026-02-25 11:57:58 +01:00
Andras Bacsai	8e2f0836da	chore: prepare for PR	2026-02-25 11:52:18 +01:00
Andras Bacsai	0580af0d34	feat(healthchecks): add command health checks with input validation Add support for command-based health checks in addition to HTTP-based checks: - New health_check_type field supporting 'http' and 'cmd' values - New health_check_command field with strict regex validation - Updated allowedFields in create_application and update_by_uuid endpoints - Validation rules include max 1000 characters and safe character whitelist - Added feature tests for health check API endpoints - Added unit tests for GithubAppPolicy and SharedEnvironmentVariablePolicy	2026-02-25 11:38:09 +01:00
Andras Bacsai	609cb4190e	fix(health-checks): sanitize and validate CMD healthcheck commands - Add regex validation to restrict allowed characters (alphanumeric, spaces, and specific safe symbols) - Enforce maximum 1000 character limit on healthcheck commands - Strip newlines and carriage returns to prevent command injection - Change input field from textarea to text input in UI - Add warning callout about prohibited shell operators - Add comprehensive validation tests for both valid and malicious command patterns	2026-02-25 11:28:33 +01:00
Andras Bacsai	1759a1631c	chore: prepare for PR	2026-02-25 11:18:46 +01:00
Andras Bacsai	d8419fad93	chore: prepare for PR	2026-02-24 14:57:32 +01:00
Andras Bacsai	2986d7604e	chore: prepare for PR	2026-02-24 10:17:16 +01:00
Andras Bacsai	ec14b55f0a	chore: prepare for PR	2026-02-23 14:28:28 +01:00
Andras Bacsai	133241bac1	fix(service): resolve team lookup via service relationship (#8559 )	2026-02-23 13:24:01 +01:00
Andras Bacsai	61a54afe2b	fix(service): resolve team lookup via service relationship Update service application/database team accessors to traverse the service relation chain and add coverage to prevent null team regressions.	2026-02-23 13:23:12 +01:00
Andras Bacsai	bf51ed905f	chore: prepare for PR	2026-02-23 13:02:06 +01:00
Andras Bacsai	73170fdd33	chore: prepare for PR	2026-02-23 12:12:10 +01:00
Andras Bacsai	fd24a54304	feat(monitoring): add scheduled job monitoring dashboard (#8433 )	2026-02-18 16:16:56 +01:00
Andras Bacsai	664b31212f	chore: prepare for PR	2026-02-18 15:42:42 +01:00
Andras Bacsai	ab79a51e29	fix(api): improve scheduled tasks API with auth, validation, and execution endpoints - Add authorization checks ($this->authorize) for all read/write operations - Use customApiValidator() instead of Validator::make() to match codebase patterns - Add extra field rejection to prevent mass assignment - Use Application::ownedByCurrentTeamAPI() for consistent query patterns - Remove non-existent standalone_postgresql_id from hidden fields - Add execution listing endpoints for both applications and services - Add ScheduledTaskExecution OpenAPI schema - Use $request->only() instead of $request->all() for safe updates - Add ScheduledTaskFactory and feature tests Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 11:53:58 +01:00
Andras Bacsai	b9e6c12e8d	fix(database): disable proxy on port allocation failure (#8362 )	2026-02-15 13:47:37 +01:00
Andras Bacsai	b7480fbe38	chore: prepare for PR	2026-02-15 13:46:08 +01:00
Andras Bacsai	e9323e3550	chore: prepare for PR	2026-02-15 13:43:08 +01:00
Andras Bacsai	4ec32290cf	fix(server): improve IP uniqueness validation with team-specific error messages - Refactor server IP duplicate detection to use `first()` instead of `get()->count()` - Add team-scoped validation to distinguish between same-team and cross-team IP conflicts - Update error messages to clarify ownership: "already exists in your team" vs "in use by another team" - Apply consistent validation logic across API, boarding, and server management flows - Add comprehensive test suite for IP uniqueness enforcement across teams	2026-02-12 08:10:59 +01:00
Andras Bacsai	95e93ad899	chore: prepare for PR	2026-02-09 14:48:16 +01:00
Andras Bacsai	5d38147899	feat(api): Improve OpenAPI spec and add rate limit handling for Hetzner - Add 429 response with Retry-After header for Hetzner server creation - Create RateLimitException for proper rate limit error handling - Rename cloud_provider_token_id to cloud_provider_token_uuid with deprecation - Fix prices array schema in server-types endpoint with proper items definition - Add explicit default: true to autogenerate_domain properties - Add timeout and retry options to Docker install curl commands - Fix race condition in deployment status update using atomic query	2025-12-11 12:12:43 +01:00
Andras Bacsai	a5331db179	Fix: Correctly set session for team before creating user token	2025-12-11 11:59:59 +01:00
Andras Bacsai	56394ba093	fix: return actual error message from token validation endpoint - Return the specific error from validateProviderToken() instead of generic "Failed to validate token." message - Update test to expect the actual error message 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-12-10 13:22:53 +01:00
Andras Bacsai	596b1cb76e	refactor: extract token validation into reusable method - Add validateProviderToken() helper method to reduce code duplication - Use request body only ($request->json()->all()) to avoid route parameter conflicts - Add proper logging for token validation failures - Add missing DB import to migration file - Minor test formatting fix 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-12-10 12:56:57 +01:00
Andras Bacsai	62c394d3a1	feat: add Hetzner server provisioning API endpoints Add complete API support for Hetzner server provisioning, matching UI functionality: Cloud Provider Token Management: - POST /api/v1/cloud-tokens - Create and validate tokens - GET /api/v1/cloud-tokens - List all tokens - GET /api/v1/cloud-tokens/{uuid} - Get specific token - PATCH /api/v1/cloud-tokens/{uuid} - Update token name - DELETE /api/v1/cloud-tokens/{uuid} - Delete token - POST /api/v1/cloud-tokens/{uuid}/validate - Validate token Hetzner Resource Discovery: - GET /api/v1/hetzner/locations - List datacenters - GET /api/v1/hetzner/server-types - List server types - GET /api/v1/hetzner/images - List OS images - GET /api/v1/hetzner/ssh-keys - List SSH keys Server Provisioning: - POST /api/v1/servers/hetzner - Create server with full options Features: - Token validation against provider APIs before storage - Smart SSH key management with MD5 fingerprint deduplication - IPv4/IPv6 network configuration with preference logic - Cloud-init script support with YAML validation - Team-based isolation and security - Comprehensive test coverage (40+ test cases) - Complete documentation with curl examples and Yaak collection 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-12-10 08:38:09 +01:00
Andras Bacsai	70ff73e954	Merge branch 'next' into macau-v1 Resolved conflicts in ServerManagerJob.php by: - Keeping sentinel update check code from macau-v1 - Preserving sentinel restart code from next branch - Ensuring no duplicate code blocks	2025-12-04 15:07:36 +01:00
Andras Bacsai	4002044877	Refactor: Move sentinel update checks to ServerManagerJob and add tests for hourly dispatch	2025-12-04 14:58:18 +01:00
Andras Bacsai	e4810a28d2	Make proxy restart run as background job to prevent localhost lockout When restarting the proxy on localhost (where Coolify is running), the UI becomes inaccessible because the connection is lost. This change makes all proxy restarts run as background jobs with WebSocket notifications, allowing the operation to complete even after connection loss. Changes: - Enhanced ProxyStatusChangedUI event to carry activityId for log monitoring - Updated RestartProxyJob to dispatch status events and track activity - Simplified Navbar restart() to always dispatch job for all servers - Enhanced showNotification() to handle activity monitoring and new statuses - Added comprehensive unit and feature tests Benefits: - Prevents localhost lockout during proxy restarts - Consistent behavior across all server types - Non-blocking UI with real-time progress updates - Automatic activity log monitoring - Proper error handling and recovery 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-12-03 10:30:12 +01:00
Andras Bacsai	74bb8f49ce	Fix: Correct time inconsistency in ServerStorageCheckIndependenceTest Move Carbon::setTestNow() to the beginning of each test before creating test data. Previously, tests created servers using now() (real current time) and only afterwards called Carbon::setTestNow(), making sentinel_updated_at inconsistent with the test clock. This caused staleness calculations to use different timelines: - sentinel_updated_at was based on real time (e.g., Dec 2024) - Test execution time was frozen at 2025-01-15 Now all timestamps use the same frozen test time, making staleness checks predictable and tests reliable regardless of when they run. Affected tests (all 7 test cases in the file): - does not dispatch storage check when sentinel is in sync - dispatches storage check when sentinel is out of sync - dispatches storage check when sentinel is disabled - respects custom hourly storage check frequency when sentinel is out of sync - handles VALID_CRON_STRINGS mapping correctly when sentinel is out of sync - respects server timezone for storage checks when sentinel is out of sync - does not dispatch storage check outside schedule 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-12-03 10:22:09 +01:00
Andras Bacsai	56a0143a25	Fix: Prevent ServerStorageCheckJob duplication when Sentinel is active When Sentinel is enabled and in sync, ServerStorageCheckJob was being dispatched from two locations causing unnecessary duplication: 1. PushServerUpdateJob (every ~30s with real-time filesystem data) 2. ServerManagerJob (scheduled cron check via SSH) This commit modifies ServerManagerJob to only dispatch ServerStorageCheckJob when Sentinel is out of sync or disabled. When Sentinel is active and in sync, PushServerUpdateJob provides real-time storage data, making the scheduled SSH check redundant. Benefits: - Eliminates duplicate storage checks when Sentinel is working - Reduces unnecessary SSH overhead - Storage checks still run as fallback when Sentinel fails - Maintains scheduled checks for servers without Sentinel Updated tests to reflect new behavior: - Storage check NOT dispatched when Sentinel is in sync - Storage check dispatched when Sentinel is out of sync or disabled - All timezone and frequency tests updated accordingly 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-12-03 10:05:10 +01:00
Andras Bacsai	f75bc85bc1	Merge branch 'next' into decouple-storage-from-sentinel	2025-12-03 09:19:09 +01:00
Andras Bacsai	b47181c790	Decouple ServerStorageCheckJob from Sentinel sync status Server disk usage checks now run on their configured schedule regardless of Sentinel status, eliminating monitoring blind spots when Sentinel is offline, out of sync, or disabled. Storage checks now respect server timezone settings, consistent with patch checks. Changes: - Moved server timezone calculation to top of processServerTasks() - Extracted ServerStorageCheckJob dispatch from Sentinel conditional - Fixed default frequency to '0 23 * * *' (11 PM daily) - Added timezone parameter to storage check scheduling 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-12-02 13:36:25 +01:00
Andras Bacsai	4b119726d9	Fix Traefik email notification with clickable server links - Add URL generation to notification class using base_url() helper - Replace config('app.url') with proper base_url() for accurate instance URL - Make server names clickable links to proxy configuration page - Use data_get() with fallback values for safer template data access - Add comprehensive tests for URL generation and email rendering 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-12-02 13:08:40 +01:00
Andras Bacsai	639a56be52	fix: prevent SERVICE_FQDN/SERVICE_URL path duplication (#7370 )	2025-11-27 10:59:39 +01:00
Andras Bacsai	0ecaa191db	fix: prevent SERVICE_FQDN/SERVICE_URL path duplication on FQDN updates Add endsWith() checks before appending template paths in serviceParser() to prevent duplicate paths when parse() is called after FQDN updates. This fixes the bug where services like Appwrite realtime would get `/v1/realtime/v1/realtime`. Fixes #7363 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-27 10:57:24 +01:00
Andras Bacsai	b5666da342	test: add tests for shared environment variable spacing and resolution	2025-11-27 10:45:39 +01:00
Andras Bacsai	1d277f28dd	feat: custom docker entrypoint (#7097 )	2025-11-26 09:31:02 +01:00
Andras Bacsai	4f2d39af03	refactor: send immediate Traefik version notifications instead of delayed aggregation Move notification logic from NotifyOutdatedTraefikServersJob into CheckTraefikVersionForServerJob to send immediate notifications when outdated Traefik is detected. This is more suitable for cloud environments with thousands of servers. Changes: - CheckTraefikVersionForServerJob now sends notifications immediately after detecting outdated Traefik - Remove NotifyOutdatedTraefikServersJob (no longer needed) - Remove delay calculation logic from CheckTraefikVersionJob - Update tests to reflect new immediate notification pattern Trade-offs: - Pro: Faster notifications (immediate alerts) - Pro: Simpler codebase (removed complex delay calculation) - Pro: Better scalability for thousands of servers - Con: Teams may receive multiple notifications if they have many outdated servers 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-18 12:30:50 +01:00
Andras Bacsai	36d2c02498	refactor: move buildpack cleanup logic to model lifecycle hooks Move buildpack switching cleanup from Livewire component to Application model's boot lifecycle. This improves separation of concerns and ensures cleanup happens consistently regardless of how the buildpack change is triggered. Also clears Dockerfile-specific data when switching away from dockerfile buildpack. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-18 10:29:08 +01:00
Andras Bacsai	1270136da9	merge: merge next branch into feat-traefik-version-checker Merged latest changes from the next branch to keep the feature branch up to date. No conflicts were encountered during the merge. Changes from next branch: - Updated application deployment job error logging - Updated server manager job and instance settings - Removed PullHelperImageJob in favor of updated approach - Database migration refinements - Updated versions.json with latest component versions All automatic merges were successful and no manual conflict resolution was required. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-17 14:56:24 +01:00
Andras Bacsai	6593b2a553	feat(proxy): enhance Traefik version notifications to show patch and minor upgrades - Store both patch update and newer minor version information simultaneously - Display patch update availability alongside minor version upgrades in notifications - Add newer_branch_target and newer_branch_latest fields to traefik_outdated_info - Update all notification channels (Discord, Telegram, Slack, Pushover, Email, Webhook) - Show minor version in format (e.g., v3.6) for upgrade targets instead of patch version - Enhance UI callouts with clearer messaging about available upgrades - Remove verbose logging in favor of cleaner code structure - Handle edge case where SSH command returns empty response 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-17 09:59:17 +01:00
Andras Bacsai	cc6a538fca	refactor(proxy): implement parallel processing for Traefik version checks Addresses critical performance issues identified in code review by refactoring the monolithic CheckTraefikVersionJob into a distributed architecture with parallel processing. Changes: - Split version checking into CheckTraefikVersionForServerJob for parallel execution - Extract notification logic into NotifyOutdatedTraefikServersJob - Dispatch individual server checks concurrently to handle thousands of servers - Add comprehensive unit tests for the new job architecture - Update feature tests to cover the refactored workflow Performance improvements: - Sequential SSH calls replaced with parallel queue jobs - Scales efficiently for large installations with thousands of servers - Reduces job execution time from hours to minutes 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-14 11:42:58 +01:00

1 2 3

145 commits