minio

Commit Graph

Author	SHA1	Message	Date
Klaus Post	b1c849bedc	Don't send a canceled context to Unlock (#20409 ) AFAICT we send a canceled context to unlock (and thereby releaseAll). This will cause network calls to fail. Instead use background and add 30s timeout.	2024-09-09 08:49:49 -07:00
Harshavardhana	fb24bcfee0	fix: set audit/logger webhook retry interval to maximum 1m (#20404 )	2024-09-09 02:36:47 -07:00
Harshavardhana	8268c12cfb	Add support for audit/logger max retry and retry interval (#20402 ) Current implementation retries forever until our log buffer is full, and we start dropping events. This PR allows you to set a value until we give up on existing audit/logger batches to proceed to process the new ones. Bonus: - do not blow up buffers beyond batchSize value - do not leak the ticker if the worker returns	2024-09-08 05:15:09 -07:00
Sveinn	3f39da48ea	fix: retries and failed message counter (#20401 )	2024-09-07 17:13:57 -07:00
Klaus Post	9d5cdaa2e3	Limit Response Recorder memory (#20399 ) Disable body recording for... * admin inspect * admin metrics * profiling download Also, if the recorded body is > 10MB, drop it.	2024-09-07 12:16:04 -07:00
Praveen raj Mani	261111e728	Kafka notify: support batched commits for queue store (#20377 ) The items will be saved per target batch and will be committed to the queue store when the batch is full Also, periodically commit the batched items to the queue store based on configured commit_timeout; default is 30s; Bonus: compress queue store multi writes	2024-09-06 16:06:30 -07:00
Harshavardhana	0f1e8db4c5	all 2xx status codes to be success for audit (#20394 )	2024-09-06 15:53:34 -07:00
jiuker	241be9709c	fix: jwt error overrwriten by nil public key (#20387 )	2024-09-05 19:46:36 -07:00
Anis Eleuch	9b79eec29e	site-repl: Fix ILM document replication in some cases (#20380 ) S3 spec does not accept an ILM XML document containing both <Filter> and <Prefix> XML tags, even if both are empty. That is why we added a 'set' field in some lifecycle structures to decide when and when not to show a tag. However, we forgot to disallow marshaling of Filter when 'set' is set to false. This will fix ILM document replication in a site replication configuration in some cases.	2024-09-04 10:01:26 -07:00
Harshavardhana	c2e318dd40	remove mincache EOS related feature from upstream (#20375 )	2024-09-03 11:23:41 -07:00
Harshavardhana	504e52b45e	protect bpool from buffer pollution by invalid buffers (#20342 )	2024-08-28 18:40:52 -07:00
Harshavardhana	c65e67c357	add more details on the payload sent to webhook audit (#20335 )	2024-08-28 08:31:56 -07:00
Mark Theunissen	9511056f44	fix: simplify error logged when logger target is unreachable (#20304 )	2024-08-22 02:43:48 -07:00
Aditya Manthramurthy	8a11282522	[fix] S3Select: Add some missing input validation (#20278 ) Prevents server panic when some CSV parameters are empty.	2024-08-20 11:31:45 -07:00
Mark Theunissen	6378ca10a4	kms.ListKeys returns CreatedBy/CreatedAt when information is available (#20223 )	2024-08-17 23:43:03 -07:00
Harshavardhana	a5702f978e	remove requests deadline, instead just reject the requests (#20272 ) Additionally set - x-ratelimit-limit - x-ratelimit-remaining To indicate the request rates.	2024-08-16 01:43:49 -07:00
Klaus Post	f1302c40fe	Fix uninitialized replication stats (#20260 ) Services are unfrozen before `initBackgroundReplication` is finished. This means that the globalReplicationStats write is racy. Switch to an atomic pointer. Provide the `ReplicationPool` with the stats, so it doesn't have to be grabbed from the atomic pointer on every use. All other loads and checks are nil, and calls return empty values when stats still haven't been initialized.	2024-08-15 05:04:40 -07:00
Harshavardhana	3b1aa40372	support relative paths for KMS_SECRET_KEY_FILE (#20264 ) fixes #20251	2024-08-15 04:46:39 -07:00
Sveinn	743ddb196a	Removing the audit log retry mechanism (#20259 )	2024-08-14 15:25:08 -07:00
Klaus Post	3ffeabdfcb	Fix govet+staticcheck issues (#20263 ) This is better: https://github.com/golang/go/issues/60529	2024-08-14 10:11:51 -07:00
Harshavardhana	e7a56f35b9	flatten out audit tags, do not send as free-form (#20256 ) move away from map[string]interface{} to map[string]string to simplify the audit, and also provide concise information. avoids large allocations under load(), reduces the amount of audit information generated, as the current implementation was a bit free-form. instead all datastructures must be flattened.	2024-08-13 15:22:04 -07:00
Harshavardhana	acdb355070	update deps and update azure WARM tier implementation (#20247 )	2024-08-13 11:21:34 -07:00
Klaus Post	d8f0e0ea6e	Simplify error logging on event send (#20246 ) Overly verbose, hard to read and can leak data. Print even as JSON and simplify target&error printing.	2024-08-12 08:55:28 -07:00
Harshavardhana	2e0fd2cba9	implement a safer completeMultipart implementation (#20227 ) - optimize writing part.N.meta by writing both part.N and its meta in sequence without network component. - remove part.N.meta, part.N which were partially success ful, in quorum loss situations during renamePart() - allow for strict read quorum check arbitrated via ETag for the given part number, this makes it double safer upon final commit. - return an appropriate error when read quorum is missing, instead of returning InvalidPart{}, which is non-retryable error. This kind of situation can happen when many nodes are going offline in rotation, an example of such a restart() behavior is statefulset updates in k8s. fixes #20091	2024-08-12 01:38:15 -07:00
Andreas Auernhammer	14876a4df1	ldap: use custom TLS cipher suites (#20221 ) This commit replaces the LDAP client TLS config and adds a custom list of TLS cipher suites which support RSA key exchange (RSA kex). Some LDAP server connections experience a significant slowdown when these cipher suites are not available. The Go TLS stack disables them by default. (Can be enabled via GODEBUG=tlsrsakex=1). fixes https://github.com/minio/minio/issues/20214 With a custom list of TLS ciphers, Go can pick the TLS RSA key-exchange cipher. Ref: ``` if c.CipherSuites != nil { return c.CipherSuites } if tlsrsakex.Value() == "1" { return defaultCipherSuitesWithRSAKex } ``` Ref: https://cs.opensource.google/go/go/+/refs/tags/go1.22.5:src/crypto/tls/common.go;l=1017 Signed-off-by: Andreas Auernhammer <github@aead.dev>	2024-08-07 05:59:47 -07:00
Harshavardhana	a17f14f73a	separate lock from common grid to avoid epoll contention (#20180 ) epoll contention on TCP causes latency build-up when we have high volume ingress. This PR is an attempt to relieve this pressure. upstream issue https://github.com/golang/go/issues/65064 It seems to be a deeper problem; haven't yet tried the fix provide in this issue, but however this change without changing the compiler helps. Of course, this is a workaround for now, hoping for a more comprehensive fix from Go runtime.	2024-07-29 11:10:04 -07:00
Klaus Post	59788e25c7	Update connection deadlines less frequently (#20166 ) Only set write deadline on connections every second. Combine the 2 write locations into 1.	2024-07-26 10:40:11 -07:00
Harshavardhana	064f36ca5a	move to GET for internal stream READs instead of POST (#20160 ) the main reason is to let Go net/http perform necessary book keeping properly, and in essential from consistency point of view its GETs all the way. Deprecate sendFile() as its buggy inside Go runtime.	2024-07-26 05:55:01 -07:00
Klaus Post	15b609ecea	Expose RPC reconnections and ping time (#20157 ) - Keeps track of reconnection count. - Keeps track of connection ping roundtrip times. Sends timestamp in ping message. - Allow ping without payload.	2024-07-25 14:07:21 -07:00
Harshavardhana	3b21bb5be8	use unixNanoTime instead of time.Time in lockRequestorInfo (#20140 ) Bonus: Skip Source, Quorum fields in lockArgs that are never sent during Unlock() phase.	2024-07-24 03:24:01 -07:00
Harshavardhana	6fe2b3f901	avoid sendFile() for ranges or object lengths < 4MiB (#20141 )	2024-07-24 03:22:50 -07:00
Harshavardhana	91805bcab6	add optimizations to bring performance on unversioned READS (#20128 ) allow non-inlined on disk to be inlined via an unversioned ReadVersion() call, we only need ReadXL() to resolve objects with multiple versions only. The choice of this block makes it to be dynamic and chosen by the user via `mc admin config set` Other bonus things - Start measuring internode TTFB performance. - Set TCP_NODELAY, TCP_CORK for low latency	2024-07-23 03:53:03 -07:00
Klaus Post	c0e2886e37	Tweak grid for less writes (#20129 ) Use `runtime.Gosched()` if we have less than maxMergeMessages and the queue is empty. Up maxMergeMessages to 50 to merge more messages into a single write. Add length check for an early bailout on readAllInto when we know packet length.	2024-07-23 03:28:14 -07:00
Andreas Auernhammer	4f5dded4d4	fips: enforce FIPS-compliant TLS ciphers in FIPS mode (#20131 ) This commit enforces FIPS-compliant TLS ciphers in FIPS mode by importing the `fipsonly` module. Otherwise, MinIO still accepts non-FIPS compliant TLS connections.	2024-07-23 03:11:25 -07:00
Harshavardhana	8e618d45fc	remove unnecessary LRU for internode auth token (#20119 ) removes contentious usage of mutexes in LRU, which were never really reused in any manner; we do not need it. To trust hosts, the correct way is TLS certs; this PR completely removes this dependency, which has never been useful. ``` 0 0% 100% 25.83s 26.76% github.com/hashicorp/golang-lru/v2/expirable.(LRU[...]) 0 0% 100% 28.03s 29.04% github.com/hashicorp/golang-lru/v2/expirable.(LRU[...]) ``` Bonus: use `x-minio-time` as a nanosecond to avoid unnecessary parsing logic of time strings instead of using a more straightforward mechanism.	2024-07-22 00:04:48 -07:00
Mark Theunissen	698bb93a46	Allow a KMS Action to specify keys in the Resources of a policy (#20079 )	2024-07-16 07:03:03 -07:00
Klaus Post	ded373e600	Split handleMessages (cosmetic) (#20095 ) Split the read and write sides of handleMessages into two separate functions Cosmetic. The only non-copy-and-paste change is that `cancel(ErrDisconnected)` is moved into the defer on `readStream`.	2024-07-15 12:02:30 -07:00
Shubhendu	f944a42886	Removed user and group details from logs (#20072 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-07-14 11:12:07 -07:00
Harshavardhana	7fcb428622	do not print unexpected logs (#20083 )	2024-07-12 13:51:54 -07:00
Poorna	989c318a28	replication: make large workers configurable (#20077 ) This PR also improves throttling by reducing tokens requested from rate limiter based on available tokens to avoid exceeding throttle wait deadlines	2024-07-12 07:57:31 -07:00
Taran Pelkey	f5d2fbc84c	Add DecodeDN and QuickNormalizeDN functions to LDAP config (#20076 )	2024-07-11 18:04:53 -07:00
Harshavardhana	a8c6465f22	hide some deprecated fields from 'get' output (#20069 ) also update wording on `subnet license="" api_key=""`	2024-07-10 13:16:44 -07:00
Taran Pelkey	6c6f0987dc	Add groups to policy entities (#20052 ) * Add groups to policy entities * update comment --------- Co-authored-by: Harshavardhana <harsha@minio.io>	2024-07-10 11:41:49 -07:00
Austin Chang	5f64658faa	clarify error message for root user credential (#20043 ) Signed-off-by: Austin Chang <austin880625@gmail.com>	2024-07-10 09:57:01 -07:00
Klaus Post	0d0b0aa599	Abstract grid connections (#20038 ) Add `ConnDialer` to abstract connection creation. - `IncomingConn(ctx context.Context, conn net.Conn)` is provided as an entry point for incoming custom connections. - `ConnectWS` is provided to create web socket connections.	2024-07-08 14:44:00 -07:00
Anis Eleuch	b433bf14ba	Add typos check to Makefile (#20051 )	2024-07-08 14:39:49 -07:00
Klaus Post	2040559f71	Fix SkipReader performance with small initial read (#20030 ) If `SkipReader` is called with a small initial buffer it may be doing a huge number if Reads to skip the requested number of bytes. If a small buffer is provided grab a 32K buffer and use that. Fixes slow execution of `testAPIGetObjectWithMPHandler`. Bonuses: * Use `-short` with `-race` test. * Do all suite test types with `-short`. * Enable compressed+encrypted in `testAPIGetObjectWithMPHandler`. * Disable big file tests in `testAPIGetObjectWithMPHandler` when using `-short`.	2024-07-02 08:13:05 -07:00
Poorna	68a9f521d5	fix object lock metadata filter (#20011 )	2024-06-28 18:20:27 -07:00
Harshavardhana	f365a98029	fix: hot-reloading STS credential policy documents (#20012 ) * fix: hot-reloading STS credential policy documents * Support Role ARNs hot load policies (#28) --------- Co-authored-by: Anis Eleuch <vadmeste@users.noreply.github.com>	2024-06-28 16:17:22 -07:00
Harshavardhana	a22ce4550c	protect workers and simplify use of atomics (#19982 ) without atomic load() it is possible that for a slow receiver we would get into a hot-loop, when logCh is full and there are many incoming callers. to avoid this as a workaround enable BATCH_SIZE greater than 100 to ensure that your slow receiver receives data in bulk to avoid being throttled in some manner. this PR however fixes the unprotected access to the current workers value.	2024-06-24 18:15:27 -07:00
Taran Pelkey	168ae81b1f	Fix error when validating DN that is not under base DN (#19971 )	2024-06-21 23:35:35 -07:00
Pedro Juarez	70078eab10	Fix browser UI animation (#19966 ) Browse UI is not showing the animation because the default content-security-policy do not trust the file https://unpkg.com/detect-gpu@5.0.38/dist/benchmarks/d-apple.json the GPU library needs to identify if the web browser can play it.	2024-06-20 17:58:58 -07:00
Klaus Post	3415c4dd1e	Fix reconnected deadlock with full queue (#19964 ) When a reconnection happens, `handleMessages` must be able to complete and exit. This can be prevented in a full queue. Deadlock chain (May 10th release) ``` 1 @ 0x44110e 0x453125 0x109f88c 0x109f7d5 0x10a472c 0x10a3f72 0x10a34ed 0x4795e1 # 0x109f88b github.com/minio/minio/internal/grid.(Connection).send+0x3eb github.com/minio/minio/internal/grid/connection.go:548 # 0x109f7d4 github.com/minio/minio/internal/grid.(Connection).queueMsg+0x334 github.com/minio/minio/internal/grid/connection.go:586 # 0x10a472b github.com/minio/minio/internal/grid.(Connection).handleAckMux+0xab github.com/minio/minio/internal/grid/connection.go:1284 # 0x10a3f71 github.com/minio/minio/internal/grid.(Connection).handleMsg+0x231 github.com/minio/minio/internal/grid/connection.go:1211 # 0x10a34ec github.com/minio/minio/internal/grid.(Connection).handleMessages.func1+0x6cc github.com/minio/minio/internal/grid/connection.go:1019 ---> blocks ---> via (Connection).handleMsgWg 1 @ 0x44110e 0x454165 0x454134 0x475325 0x486b08 0x10a161a 0x10a1465 0x2470e67 0x7395a9 0x20e61af 0x20e5f1f 0x7395a9 0x22f781c 0x7395a9 0x22f89a5 0x7395a9 0x22f6e82 0x7395a9 0x22f49a2 0x7395a9 0x2206e45 0x7395a9 0x22f4d9c 0x7395a9 0x210ba06 0x7395a9 0x23089c2 0x7395a9 0x22f86e9 0x7395a9 0xd42582 0x2106c04 # 0x475324 sync.runtime_Semacquire+0x24 runtime/sema.go:62 # 0x486b07 sync.(WaitGroup).Wait+0x47 sync/waitgroup.go:116 # 0x10a1619 github.com/minio/minio/internal/grid.(Connection).reconnected+0xb9 github.com/minio/minio/internal/grid/connection.go:857 # 0x10a1464 github.com/minio/minio/internal/grid.(Connection).handleIncoming+0x384 github.com/minio/minio/internal/grid/connection.go:825 ``` Add a queue cleaner in reconnected that will pop old messages so `handleMessages` can send messages without blocking and exit appropriately for the connection to be re-established. Messages are likely dropped by the remote, but we may have some that can succeed, so we only drop when running out of space.	2024-06-20 16:11:40 -07:00
Sveinn	bce93b5cfa	Removing timeout on shutdown (#19956 )	2024-06-19 11:42:47 -07:00
Klaus Post	a6ffdf1dd4	Do not block on distributed unlocks (#19952 ) * Prevents blocking when losing quorum (standard on cluster restarts). * Time out to prevent endless buildup. Timed-out remote locks will be canceled because they miss the refresh anyway. * Reduces latency for all calls since the wall time for the roundtrip to remotes no longer adds to the requests.	2024-06-19 07:35:19 -07:00
Andreas Auernhammer	7ce28c3b1d	kms: use `GetClientCertificate` callback for KES API keys (#19921 ) This commit fixes an issue in the KES client configuration that can cause the following error when connecting to KES: ``` ERROR Failed to connect to KMS: failed to generate data key with KMS key: tls: client certificate is required ``` The Go TLS stack seems to not send a client certificate if it thinks the client certificate cannot be validated by the peer. In case of an API key, we don't care about this since we use public key pinning and the X.509 certificate is just a transport encoding. The `GetClientCertificate` seems to be honored always such that this error does not occur. Signed-off-by: Andreas Auernhammer <github@aead.dev>	2024-06-12 07:31:26 -07:00
Harshavardhana	b8b956a05d	add changes to Makefile to support dev build	2024-06-10 10:41:02 -07:00
Klaus Post	d2eed44c78	Fix replication checksum transfer (#19906 ) Compression will be disabled by default if SSE-C is specified. So we can still honor SSE-C.	2024-06-10 10:40:33 -07:00
Anis Eleuch	789cbc6fb2	heal: Dangling check to evaluate object parts separately (#19797 )	2024-06-10 08:51:27 -07:00
Klaus Post	a2cab02554	Fix SSE-C checksums (#19896 ) Compression will be disabled by default if SSE-C is specified. So we can still honor SSE-C.	2024-06-10 08:31:51 -07:00
Klaus Post	f00187033d	Two way streams for upcoming locking enhancements (#19796 )	2024-06-07 08:51:52 -07:00
Anis Eleuch	3ba857dfa1	race: Fix detected test race in the internal audit code (#19865 )	2024-06-03 08:44:50 -07:00
Harshavardhana	ba54b39c02	fix: crash when audit webhook queue_dir is not writable (#19854 ) This is regression introduced in #19275 refactor	2024-06-01 20:03:39 -07:00
Anis Eleuch	2a75225569	kafka: _MINIO_KAFKA_DEBUG to enable sarama debug messages (#19849 )	2024-06-01 08:02:59 -07:00
Klaus Post	e72429c79c	Add sizes to traces (#19851 ) added to storage and grid traces. Can provide more context for traces that aren't HTTP. Others may apply.	2024-05-31 22:17:37 -07:00
Klaus Post	c5b3f5553f	Add per connection RPC metrics (#19852 ) Provides individual and aggregate stats for each RPC connection. Example: ``` "rpc": { "collectedAt": "2024-05-31T14:33:29.1373103+02:00", "connected": 30, "disconnected": 0, "outgoingStreams": 69, "incomingStreams": 0, "outgoingBytes": 174822796, "incomingBytes": 175821566, "outgoingMessages": 768595, "incomingMessages": 768589, "outQueue": 0, "lastPongTime": "2024-05-31T12:33:28Z", "byDestination": { "http://127.0.0.1:9001": { "collectedAt": "2024-05-31T14:33:29.1373103+02:00", "connected": 5, "disconnected": 0, "outgoingStreams": 2, "incomingStreams": 0, "outgoingBytes": 38432543, "incomingBytes": 66604052, "outgoingMessages": 229496, "incomingMessages": 229575, "outQueue": 0, "lastPongTime": "2024-05-31T12:33:27Z" }, "http://127.0.0.1:9002": { "collectedAt": "2024-05-31T14:33:29.1373103+02:00", "connected": 5, "disconnected": 0, "outgoingStreams": 6, "incomingStreams": 0, "outgoingBytes": 38215680, "incomingBytes": 66121283, "outgoingMessages": 228525, "incomingMessages": 228510, "outQueue": 0, "lastPongTime": "2024-05-31T12:33:27Z" }, ... ```	2024-05-31 22:16:24 -07:00
Harshavardhana	8f93e81afb	change service account embedded policy size limit (#19840 ) Bonus: trim-off all the unnecessary spaces to allow for real 2048 characters in policies for STS handlers and re-use the code in all STS handlers.	2024-05-30 11:10:41 -07:00
Harshavardhana	aad50579ba	fix: wire up ILM sub-system properly for help (#19836 )	2024-05-30 01:14:58 -07:00
Taran Pelkey	2d53854b19	Restrict access keys for users and groups to not allow '=' or ',' (#19749 ) * initial commit * Add UTF check --------- Co-authored-by: Harshavardhana <harsha@minio.io>	2024-05-28 10:14:16 -07:00
Harshavardhana	597a785253	fix: authenticate LDAP via actual DN instead of normalized DN (#19805 ) fix: authenticate LDAP via actual DN instead of normalized DN Normalized DN is only for internal representation, not for external communication, any communication to LDAP must be based on actual user DN. LDAP servers do not understand normalized DN. fixes #19757	2024-05-25 06:43:06 -07:00
Aditya Manthramurthy	5f78691fcf	ldap: Add user DN attributes list config param (#19758 ) This change uses the updated ldap library in minio/pkg (bumped up to v3). A new config parameter is added for LDAP configuration to specify extra user attributes to load from the LDAP server and to store them as additional claims for the user. A test is added in sts_handlers.go that shows how to access the LDAP attributes as a claim. This is in preparation for adding SSH pubkey authentication to MinIO's SFTP integration.	2024-05-24 16:05:23 -07:00
Shireesh Anjal	5659cddc84	Add cluster config metrics in metrics-v3 (#19507 ) endpoint: /minio/metrics/v3/cluster/config metrics: - write_quorum - rrs_parity - standard_parity	2024-05-24 05:50:46 -07:00
Krishnan Parthasarathi	6d5bc045bc	Disallow ExpiredObjectAllVersions with object lock (#19792 ) Relaxes restrictions on Expiration and NoncurrentVersionExpiration placed by https://github.com/minio/minio/pull/19785. ref: https://docs.aws.amazon.com/AmazonS3/latest/userguide/object-lock-managing.html#object-lock-managing-lifecycle > Object lifecycle management configurations continue functioning normally on protected objects, including placing delete markers. However, a locked version of an object cannot be deleted by a S3 Lifecycle expiration policy. Object Lock is maintained regardless of the object's storage class and throughout S3 Lifecycle transitions between storage classes.	2024-05-22 18:12:48 -07:00
Shubhendu	7c7650b7c3	Add sufficient deadlines and countermeasures to handle hung node scenario (#19688 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io> Signed-off-by: Harshavardhana <harsha@minio.io>	2024-05-22 16:07:14 -07:00
Harshavardhana	ca80eced24	usage of deadline conn at Accept() breaks websocket (#19789 ) fortunately not wired up to use, however if anyone enables deadlines for conn then sporadically MinIO startups fail.	2024-05-22 10:49:27 -07:00
Anis Eleuch	d0e0b81d8e	Fix race get/set system/audit targest to avoid race errors (#19790 )	2024-05-22 09:23:03 -07:00
jiuker	391baa1c9a	test: add reject ilm rule test case (#19788 )	2024-05-22 04:26:59 -07:00
Harshavardhana	ae14681c3e	Revert "Fix two-way stream cancelation and pings (#19763 )" This reverts commit `4d698841f4`.	2024-05-22 03:00:00 -07:00
Klaus Post	4d698841f4	Fix two-way stream cancelation and pings (#19763 ) Do not log errors on oneway streams when sending ping fails. Instead, cancel the stream. This also makes sure pings are sent when blocked on sending responses.	2024-05-22 01:25:25 -07:00
jiuker	9906b3ade9	fix: reject ilm rule when bucket LockEnabled (#19785 )	2024-05-21 23:50:03 -07:00
Harshavardhana	1fd90c93ff	re-use StorageAPI while loading drive formats (#19770 ) Bonus: safe settings for deployment ID to avoid races	2024-05-19 01:06:49 -07:00
Harshavardhana	08d74819b6	handle racy updates to globalSite config (#19750 ) ``` ================== WARNING: DATA RACE Read at 0x0000082be990 by goroutine 205: github.com/minio/minio/cmd.setCommonHeaders() Previous write at 0x0000082be990 by main goroutine: github.com/minio/minio/cmd.lookupConfigs() ```	2024-05-16 16:13:47 -07:00
Harshavardhana	0b3eb7f218	add more deadlines and pass around context under most situations (#19752 )	2024-05-15 15:19:00 -07:00
Harshavardhana	d3db7d31a3	fix: add deadlines for all synchronous REST callers (#19741 ) add deadlines that can be dynamically changed via the drive max timeout values. Bonus: optimize "file not found" case and hung drives/network - circuit break the check and return right away instead of waiting.	2024-05-15 09:52:29 -07:00
Klaus Post	6d3e0c7db6	Tweak one way stream ping (#19743 ) Do not log errors on oneway streams when sending ping fails. Instead cancel the stream. This also makes sure pings are sent when blocked on sending responses. I will do a separate PR that includes this and adds pings to two-way streams as well as tests for pings.	2024-05-15 08:39:21 -07:00
Klaus Post	d4b391de1b	Add PutObject Ring Buffer (#19605 ) Replace the `io.Pipe` from streamingBitrotWriter -> CreateFile with a fixed size ring buffer. This will add an output buffer for encoded shards to be written to disk - potentially via RPC. This will remove blocking when `(*streamingBitrotWriter).Write` is called, and it writes hashes and data. With current settings, the write looks like this: ``` Outbound ┌───────────────────┐ ┌────────────────┐ ┌───────────────┐ ┌────────────────┐ │ │ Parr. │ │ (http body) │ │ │ │ │ Bitrot Hash │ Write │ Pipe │ Read │ HTTP buffer │ Write (syscall) │ TCP Buffer │ │ Erasure Shard │ ──────────► │ (unbuffered) │ ────────────► │ (64K Max) │ ───────────────────► │ (4MB) │ │ │ │ │ │ (io.Copy) │ │ │ └───────────────────┘ └────────────────┘ └───────────────┘ └────────────────┘ ``` We write a Hash (32 bytes). Since the pipe is unbuffered, it will block until the 32 bytes have been delivered to the TCP buffer, and the next Read hits the Pipe. Then we write the shard data. This will typically be bigger than 64KB, so it will block until two blocks have been read from the pipe. When we insert a ring buffer: ``` Outbound ┌───────────────────┐ ┌────────────────┐ ┌───────────────┐ ┌────────────────┐ │ │ │ │ (http body) │ │ │ │ │ Bitrot Hash │ Write │ Ring Buffer │ Read │ HTTP buffer │ Write (syscall) │ TCP Buffer │ │ Erasure Shard │ ──────────► │ (2MB) │ ────────────► │ (64K Max) │ ───────────────────► │ (4MB) │ │ │ │ │ │ (io.Copy) │ │ │ └───────────────────┘ └────────────────┘ └───────────────┘ └────────────────┘ ``` The hash+shard will fit within the ring buffer, so writes will not block - but will complete after a memcopy. Reads can fill the 64KB buffer if there is data for it. If the network is congested, the ring buffer will become filled, and all syscalls will be on full buffers. Only when the ring buffer is filled will erasure coding start blocking. Since there is always "space" to write output data, we remove the parallel writing since we are always writing to memory now, and the goroutine synchronization overhead probably not worth taking. If the output were blocked in the existing, we would still wait for it to unblock in parallel write, so it would make no difference there - except now the ring buffer smoothes out the load. There are some micro-optimizations we could look at later. The biggest is that, in most cases, we could encode directly to the ring buffer - if we are not at a boundary. Also, "force filling" the Read requests (i.e., blocking until a full read can be completed) could be investigated and maybe allow concurrent memory on read and write.	2024-05-14 17:11:04 -07:00
jiuker	01bfc78535	Optimization: reuse hashedSecret when LookupConfig (#19724 )	2024-05-12 22:52:27 -07:00
Harshavardhana	9a267f9270	allow caller context during reloads() to cancel (#19687 ) canceled callers might linger around longer, can potentially overwhelm the system. Instead provider a caller context and canceled callers don't hold on to them. Bonus: we have no reason to cache errors, we should never cache errors otherwise we can potentially have quorum errors creeping in unexpectedly. We should let the cache when invalidating hit the actual resources instead.	2024-05-08 17:51:34 -07:00
Anis Eleuch	67bd71b7a5	grid: Fix a window of a disconnected node not marked as offline (#19703 ) LastPong is saved as nanoseconds after a connection or reconnection but saved as seconds when receiving a pong message. The code deciding if a pong is too old can be skewed since it assumes LastPong is only in seconds.	2024-05-08 17:50:13 -07:00
Klaus Post	ec49fff583	Accept multipart checksums with part count (#19680 ) Accept multipart uploads where the combined checksum provides the expected part count. It seems this was added by AWS to make the API more consistent, even if the data is entirely superfluous on multiple levels. Improves AWS S3 compatibility.	2024-05-08 09:18:34 -07:00
Andreas Auernhammer	8b660e18f2	kms: add support for MinKMS and remove some unused/broken code (#19368 ) This commit adds support for MinKMS. Now, there are three KMS implementations in `internal/kms`: Builtin, MinIO KES and MinIO KMS. Adding another KMS integration required some cleanup. In particular: - Various KMS APIs that haven't been and are not used have been removed. A lot of the code was broken anyway. - Metrics are now monitored by the `kms.KMS` itself. For basic metrics this is simpler than collecting metrics for external servers. In particular, each KES server returns its own metrics and no cluster-level view. - The builtin KMS now uses the same en/decryption implemented by MinKMS and KES. It still supports decryption of the previous ciphertext format. It's backwards compatible. - Data encryption keys now include a master key version since MinKMS supports multiple versions (~4 billion in total and 10000 concurrent) per key name. Signed-off-by: Andreas Auernhammer <github@aead.dev>	2024-05-07 16:55:37 -07:00
Harshavardhana	8ff70ea5a9	turn-off coloring if we have std{err,out} dumb terminals (#19667 )	2024-05-03 17:17:57 -07:00
Harshavardhana	1526e7ece3	extend server config.yaml to support per pool set drive count (#19663 ) This is to support deployments migrating from a multi-pooled wider stripe to lower stripe. MINIO_STORAGE_CLASS_STANDARD is still expected to be same for all pools. So you can satisfy adding custom drive count based pools by adjusting the storage class value. ``` version: v2 address: ':9000' rootUser: 'minioadmin' rootPassword: 'minioadmin' console-address: ':9001' pools: # Specify the nodes and drives with pools - args: - 'node{11...14}.example.net/data{1...4}' - args: - 'node{15...18}.example.net/data{1...4}' - args: - 'node{19...22}.example.net/data{1...4}' - args: - 'node{23...34}.example.net/data{1...10}' set-drive-count: 6 ```	2024-05-03 08:54:03 -07:00
Klaus Post	4a60a7794d	Use better gzip for log rotate (#19651 ) Should be 2x faster with same usage.	2024-05-02 04:38:40 -07:00
Harshavardhana	402a3ac719	support compression after rotation of logs (#19647 )	2024-05-01 15:38:07 -07:00
Harshavardhana	8c1bba681b	add logrotate support for MinIO logs (#19641 )	2024-05-01 10:57:52 -07:00
Harshavardhana	08ff702434	enhance ListSVCs() API to return more info to avoid InfoSvc() (#19642 ) ConsoleUI like applications rely on combination of ListServiceAccounts() and InfoServiceAccount() to populate UI elements, however individually these calls can be slow causing the entire UI to load sluggishly.	2024-05-01 05:41:13 -07:00
Krishnan Parthasarathi	7926401cbd	ilm: Handle DeleteAllVersions action differently for DEL markers (#19481 ) i.e., this rule element doesn't apply to DEL markers. This is a breaking change to how ExpiredObejctDeleteAllVersions functions today. This is necessary to avoid the following highly probable footgun scenario in the future. Scenario: The user uses tags-based filtering to select an object's time to live(TTL). The application sometimes deletes objects, too, making its latest version a DEL marker. The previous implementation skipped tag-based filters if the newest version was DEL marker, voiding the tag-based TTL. The user is surprised to find objects that have expired sooner than expected. * Add DelMarkerExpiration action This ILM action removes all versions of an object if its the latest version is a DEL marker. ```xml <DelMarkerObjectExpiration> <Days> 10 </Days> </DelMarkerObjectExpiration> ``` 1. Applies only to objects whose, • The latest version is a DEL marker. • satisfies the number of days criteria 2. Deletes all versions of this object 3. Associated rule can't have tag-based filtering Includes, - New bucket event type for deletion due to DelMarkerExpiration	2024-04-30 18:11:10 -07:00
jiuker	6bb10a81a6	avoid data race for testing (#19635 )	2024-04-30 08:03:35 -07:00
Harshavardhana	a372c6a377	a bunch of fixes for error handling (#19627 ) - handle errFileCorrupt properly - micro-optimization of sending done() response quicker to close the goroutine. - fix logger.Event() usage in a couple of places - handle the rest of the client to return a different error other than lastErr() when the client is closed.	2024-04-28 10:53:50 -07:00

1 2 3 4 5 ...

876 Commits