Skip to content

使用 Ray Tune 进行超参数调整

译者:片刻小哥哥

项目地址:https://pytorch.apachecn.org/2.0/tutorials/beginner/hyperparameter_tuning_tutorial

原始地址:https://pytorch.org/tutorials/beginner/hyperparameter_tuning_tutorial.html

超参数调整可以区分平均模型和 高度准确的模型。通常,简单的事情(例如选择不同的学习率或更改 网络层大小)可能会对模型性能产生巨大影响。

幸运的是,有一些工具可以帮助找到最佳参数组合。 Ray Tune 是一个行业标准工具,用于分布式超参数调整。 Ray Tune 包含最新的超参数搜索\算法,与 TensorBoard 和其他分析库集成, 通过 Ray’s 分布式机器学习引擎 原生支持分布式训练 .

在本教程中,我们将向您展示如何将 Ray Tune 集成到 PyTorch 训练工作流程中。我们将扩展 PyTorch 文档中的本教程 用于训练 CIFAR10 图像分类器。

正如您将看到的,我们只需要添加一些细微的修改。特别是,我们 需要

  1. 将数据加载和训练包装在函数中, 2.使一些网络参数可配置, 3.添加检查点(可选), 4.并定义模型调整的搜索空间

要运行本教程,请确保安装了以下软件包:

  • ray[tune] : 分布式超参数调整库
  • torchvision : 用于数据转换器

设置/导入

让’s 从导入开始:

from functools import partial
import os
import torch
import torch.nn as nn
import torch.nn.functional as F
import torch.optim as optim
from torch.utils.data import random_split
import torchvision
import torchvision.transforms as transforms
from ray import tune
from ray.air import Checkpoint, session
from ray.tune.schedulers import ASHAScheduler

构建 PyTorch 模型需要大部分导入。只有最后三个 导入用于 Ray Tune。

数据加载器

我们将数据加载器包装在它们自己的函数中,并传递一个全局数据目录。 这样我们就可以在不同的试验之间共享数据目录。

def load_data(data_dir="./data"):
    transform = transforms.Compose(
        [transforms.ToTensor(), transforms.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5))]
    )

    trainset = torchvision.datasets.CIFAR10(
        root=data_dir, train=True, download=True, transform=transform
    )

    testset = torchvision.datasets.CIFAR10(
        root=data_dir, train=False, download=True, transform=transform
    )

    return trainset, testset

可配置神经网络

我们只能调整那些可配置的参数。 在此示例中,我们可以指定 全连接层的层大小:

class Net(nn.Module):
    def __init__(self, l1=120, l2=84):
        super(Net, self).__init__()
        self.conv1 = nn.Conv2d(3, 6, 5)
        self.pool = nn.MaxPool2d(2, 2)
        self.conv2 = nn.Conv2d(6, 16, 5)
        self.fc1 = nn.Linear(16 * 5 * 5, l1)
        self.fc2 = nn.Linear(l1, l2)
        self.fc3 = nn.Linear(l2, 10)

    def forward(self, x):
        x = self.pool(F.relu(self.conv1(x)))
        x = self.pool(F.relu(self.conv2(x)))
        x = torch.flatten(x, 1)  # flatten all dimensions except batch
        x = F.relu(self.fc1(x))
        x = F.relu(self.fc2(x))
        x = self.fc3(x)
        return x

训练函数

现在它变得有趣了,因为我们对示例进行了一些更改 来自 PyTorch 文档

我们将训练脚本包装在函数中 `train_cifar(config,

data_dir=None)` 。

config 参数将接收我们想要 训练的超参数。 data_dir 指定我们加载和存储数据的目录, 以便多次运行可以共享相同的数据源。 我们还在运行开始时加载模型和优化器状态,如果提供检查点。在本教程的后面,您将找到有关如何 保存检查点及其用途的信息。

net = Net(config["l1"], config["l2"])

checkpoint = session.get_checkpoint()

if checkpoint:
    checkpoint_state = checkpoint.to_dict()
    start_epoch = checkpoint_state["epoch"]
    net.load_state_dict(checkpoint_state["net_state_dict"])
    optimizer.load_state_dict(checkpoint_state["optimizer_state_dict"])
else:
    start_epoch = 0

优化器的学习率也是可配置的:

optimizer = optim.SGD(net.parameters(), lr=config["lr"], momentum=0.9)

我们还将训练数据分为训练和验证子集。因此,我们对 80% 的数据进行训练,并计算剩余 20% 的验证损失。我们迭代训练和测试集的批量大小 也是可配置的。

使用 DataParallel 添加(多)GPU 支持

图像分类很大程度上受益于 GPU。幸运的是,我们可以继续在 Ray Tune 中使用 PyTorch’s 抽象。因此,我们可以将模型包装在 nn.DataParallel 中以支持多个 GPU 上的数据并行训练:

device = "cpu"
if torch.cuda.is_available():
    device = "cuda:0"
    if torch.cuda.device_count() > 1:
        net = nn.DataParallel(net)
net.to(device)

通过使用 device 变量,我们确保在没有可用 GPU 的情况下训练也能正常工作。 PyTorch 要求我们显式地将数据发送到 GPU 内存, 如下所示:

for i, data in enumerate(trainloader, 0):
    inputs, labels = data
    inputs, labels = inputs.to(device), labels.to(device)

该代码现在支持在 CPU、单个 GPU 和多个 GPU 上进行训练。值得注意的是,Ray 还支持 分数 GPU ,因此我们可以在试验之间共享 GPU ,只要模型仍然适合 GPU 内存。我们’ 稍后再回来 讨论这个问题。

与 Ray Tune 通信

最有趣的部分是与 Ray Tune 的通信:

checkpoint_data = {
    "epoch": epoch,
    "net_state_dict": net.state_dict(),
    "optimizer_state_dict": optimizer.state_dict(),
}
checkpoint = Checkpoint.from_dict(checkpoint_data)

session.report(
    {"loss": val_loss / val_steps, "accuracy": correct / total},
    checkpoint=checkpoint,
)

这里我们首先保存一个检查点,然后向 Ray Tune 报告一些指标。具体来说, 我们将验证损失和准确性发送回 Ray Tune。然后,Ray Tune 可以使用这些指标 来决定哪种超参数配置可以带来最佳结果。这些指标 还可以用于尽早停止表现不佳的试验,以避免 在这些试验上浪费资源。

检查点保存是可选的,但是,如果我们想使用高级 s调度程序,例如 基于人群的训练 。 此外,通过保存检查点,我们可以稍后加载经过训练的模型并在测试集上验证它们。最后,保存检查点对于容错很有用,并且允许 中断训练并在以后继续训练。

完整训练函数

完整的代码示例如下所示:

def train_cifar(config, data_dir=None):
    net = Net(config["l1"], config["l2"])

    device = "cpu"
    if torch.cuda.is_available():
        device = "cuda:0"
        if torch.cuda.device_count() > 1:
            net = nn.DataParallel(net)
    net.to(device)

    criterion = nn.CrossEntropyLoss()
    optimizer = optim.SGD(net.parameters(), lr=config["lr"], momentum=0.9)

    checkpoint = session.get_checkpoint()

    if checkpoint:
        checkpoint_state = checkpoint.to_dict()
        start_epoch = checkpoint_state["epoch"]
        net.load_state_dict(checkpoint_state["net_state_dict"])
        optimizer.load_state_dict(checkpoint_state["optimizer_state_dict"])
    else:
        start_epoch = 0

    trainset, testset = load_data(data_dir)

    test_abs = int(len(trainset) * 0.8)
    train_subset, val_subset = random_split(
        trainset, [test_abs, len(trainset) - test_abs]
    )

    trainloader = torch.utils.data.DataLoader(
        train_subset, batch_size=int(config["batch_size"]), shuffle=True, num_workers=8
    )
    valloader = torch.utils.data.DataLoader(
        val_subset, batch_size=int(config["batch_size"]), shuffle=True, num_workers=8
    )

    for epoch in range(start_epoch, 10):  # loop over the dataset multiple times
        running_loss = 0.0
        epoch_steps = 0
        for i, data in enumerate(trainloader, 0):
            # get the inputs; data is a list of [inputs, labels]
            inputs, labels = data
            inputs, labels = inputs.to(device), labels.to(device)

            # zero the parameter gradients
            optimizer.zero_grad()

            # forward + backward + optimize
            outputs = net(inputs)
            loss = criterion(outputs, labels)
            loss.backward()
            optimizer.step()

            # print statistics
            running_loss += loss.item()
            epoch_steps += 1
            if i % 2000 == 1999:  # print every 2000 mini-batches
                print(
                    "[%d, %5d] loss: %.3f"
                    % (epoch + 1, i + 1, running_loss / epoch_steps)
                )
                running_loss = 0.0

        # Validation loss
        val_loss = 0.0
        val_steps = 0
        total = 0
        correct = 0
        for i, data in enumerate(valloader, 0):
            with torch.no_grad():
                inputs, labels = data
                inputs, labels = inputs.to(device), labels.to(device)

                outputs = net(inputs)
                _, predicted = torch.max(outputs.data, 1)
                total += labels.size(0)
                correct += (predicted == labels).sum().item()

                loss = criterion(outputs, labels)
                val_loss += loss.cpu().numpy()
                val_steps += 1

        checkpoint_data = {
            "epoch": epoch,
            "net_state_dict": net.state_dict(),
            "optimizer_state_dict": optimizer.state_dict(),
        }
        checkpoint = Checkpoint.from_dict(checkpoint_data)

        session.report(
            {"loss": val_loss / val_steps, "accuracy": correct / total},
            checkpoint=checkpoint,
        )
    print("Finished Training")

正如您所看到的,大部分代码直接改编自原始示例。

测试集精度

-

通常,机器学习模型的性能是在保留测试集上进行测试的,其中使用的数据尚未用于训练模型。我们还将其包装在 函数中:

def test_accuracy(net, device="cpu"):
    trainset, testset = load_data()

    testloader = torch.utils.data.DataLoader(
        testset, batch_size=4, shuffle=False, num_workers=2
    )

    correct = 0
    total = 0
    with torch.no_grad():
        for data in testloader:
            images, labels = data
            images, labels = images.to(device), labels.to(device)
            outputs = net(images)
            _, predicted = torch.max(outputs.data, 1)
            total += labels.size(0)
            correct += (predicted == labels).sum().item()

    return correct / total

该函数还需要一个 device 参数,因此我们可以 在 GPU 上进行测试集验证。

配置搜索空间

最后,我们需要定义 Ray Tune’s 搜索空间。这是一个示例:

config = {
    "l1": tune.choice([2 ** i for i in range(9)]),
    "l2": tune.choice([2 ** i for i in range(9)]),
    "lr": tune.loguniform(1e-4, 1e-1),
    "batch_size": tune.choice([2, 4, 8, 16])
}

tune.choice() 接受统一采样的值列表。 在此示例中, l1l2 参数\ n 应该是 4 到 256 之间的 2 的幂,因此可以是 4、8、16、32、64、128 或 256。

lr (学习率)应该在 0.0001 到 0.1 之间均匀采样。最后, 批量大小可以在 2、4、8 和 16 之间选择。

在每次试验中,Ray Tune 现在将从这些搜索空间中随机采样参数组合。然后,它将并行训练多个模型,并找到其中性能最好的一个。我们还使用 ASHAScheduler 它将提前终止 不良执行试验。

我们用 functools.partial 包装 train_cifar 函数来设置常量

data_dir 参数。我们还可以告诉 Ray Tune 每次试验 应提供哪些资源:

gpus_per_trial = 2
# ...
result = tune.run(
    partial(train_cifar, data_dir=data_dir),
    resources_per_trial={"cpu": 8, "gpu": gpus_per_trial},
    config=config,
    num_samples=num_samples,
    scheduler=scheduler,
    checkpoint_at_end=True)

您可以指定可用的 CPU 数量,例如 以增加 PyTorch DataLoader 实例的 num_workers 数量。在每次试验中,所选的 GPU 数量对 PyTorch 可见。试验无法访问 未请求’ 的 GPU - 因此您’ 不必关心 使用同一组资源的两个试验。

这里我们还可以指定分数 GPU,因此像 gpus_per_Trial=0.5 这样的内容是完全有效的。然后,这些试验将相互共享 GPU。 您只需确保模型仍然适合 GPU 内存即可。

训练模型后,我们将找到性能最好的模型并从检查点文件加载经过训练的 网络。然后我们获得测试集的准确性并通过打印报告 一切。

完整的 main 函数如下所示:

def main(num_samples=10, max_num_epochs=10, gpus_per_trial=2):
    data_dir = os.path.abspath("./data")
    load_data(data_dir)
    config = {
        "l1": tune.choice([2**i for i in range(9)]),
        "l2": tune.choice([2**i for i in range(9)]),
        "lr": tune.loguniform(1e-4, 1e-1),
        "batch_size": tune.choice([2, 4, 8, 16]),
    }
    scheduler = ASHAScheduler(
        metric="loss",
        mode="min",
        max_t=max_num_epochs,
        grace_period=1,
        reduction_factor=2,
    )
    result = tune.run(
        partial(train_cifar, data_dir=data_dir),
        resources_per_trial={"cpu": 2, "gpu": gpus_per_trial},
        config=config,
        num_samples=num_samples,
        scheduler=scheduler,
    )

    best_trial = result.get_best_trial("loss", "min", "last")
    print(f"Best trial config: {best_trial.config}")
    print(f"Best trial final validation loss: {best_trial.last_result['loss']}")
    print(f"Best trial final validation accuracy: {best_trial.last_result['accuracy']}")

    best_trained_model = Net(best_trial.config["l1"], best_trial.config["l2"])
    device = "cpu"
    if torch.cuda.is_available():
        device = "cuda:0"
        if gpus_per_trial > 1:
            best_trained_model = nn.DataParallel(best_trained_model)
    best_trained_model.to(device)

    best_checkpoint = best_trial.checkpoint.to_air_checkpoint()
    best_checkpoint_data = best_checkpoint.to_dict()

    best_trained_model.load_state_dict(best_checkpoint_data["net_state_dict"])

    test_acc = test_accuracy(best_trained_model, device)
    print("Best trial test set accuracy: {}".format(test_acc))


if __name__ == "__main__":
    # You can change the number of GPUs per trial here:
    main(num_samples=10, max_num_epochs=10, gpus_per_trial=0)
Downloading https://www.cs.toronto.edu/~kriz/cifar-10-python.tar.gz to /var/lib/jenkins/workspace/beginner_source/data/cifar-10-python.tar.gz

  0% 0/170498071 [00:00<?, ?it/s]
  0% 393216/170498071 [00:00<00:45, 3746334.21it/s]
  2% 3145728/170498071 [00:00<00:09, 17436737.90it/s]
  4% 6225920/170498071 [00:00<00:07, 23367611.59it/s]
  5% 9207808/170498071 [00:00<00:06, 25833973.42it/s]
  7% 12288000/170498071 [00:00<00:05, 27559162.59it/s]
  9% 15400960/170498071 [00:00<00:05, 28644180.15it/s]
 11% 18415616/170498071 [00:00<00:05, 29008808.83it/s]
 13% 21594112/170498071 [00:00<00:04, 29856262.02it/s]
 14% 24641536/170498071 [00:00<00:04, 29973490.66it/s]
 16% 27656192/170498071 [00:01<00:04, 30019521.27it/s]
 18% 30736384/170498071 [00:01<00:04, 30170808.41it/s]
 20% 33783808/170498071 [00:01<00:04, 30190525.05it/s]
 22% 36831232/170498071 [00:01<00:04, 30167201.58it/s]
 23% 39878656/170498071 [00:01<00:04, 29260428.25it/s]
 25% 42827776/170498071 [00:01<00:04, 26982577.57it/s]
 27% 45580288/170498071 [00:01<00:04, 25636421.39it/s]
 28% 48201728/170498071 [00:01<00:04, 24915725.82it/s]
 30% 50724864/170498071 [00:01<00:05, 23913970.15it/s]
 31% 53149696/170498071 [00:02<00:04, 23532339.49it/s]
 33% 55541760/170498071 [00:02<00:04, 23383498.24it/s]
 34% 57901056/170498071 [00:02<00:04, 22903012.20it/s]
 35% 60227584/170498071 [00:02<00:04, 22780849.92it/s]
 37% 62521344/170498071 [00:02<00:04, 22779328.41it/s]
 38% 64815104/170498071 [00:02<00:04, 22556584.11it/s]
 39% 67305472/170498071 [00:02<00:04, 23175939.68it/s]
 41% 69632000/170498071 [00:02<00:04, 22891094.15it/s]
 42% 71991296/170498071 [00:02<00:04, 22940283.94it/s]
 44% 74317824/170498071 [00:02<00:04, 22988041.55it/s]
 45% 76644352/170498071 [00:03<00:04, 22776567.73it/s]
 46% 78938112/170498071 [00:03<00:04, 22793542.87it/s]
 48% 81297408/170498071 [00:03<00:03, 22959098.16it/s]
 49% 83853312/170498071 [00:03<00:03, 23617265.17it/s]
 51% 86474752/170498071 [00:03<00:03, 24258118.52it/s]
 52% 89030656/170498071 [00:03<00:03, 24568422.57it/s]
 54% 91652096/170498071 [00:03<00:03, 24970464.06it/s]
 55% 94273536/170498071 [00:03<00:03, 25296320.63it/s]
 57% 96862208/170498071 [00:03<00:02, 25420331.92it/s]
 58% 99450880/170498071 [00:03<00:02, 25452213.25it/s]
 60% 102039552/170498071 [00:04<00:02, 25515213.67it/s]
 61% 104595456/170498071 [00:04<00:02, 25447731.73it/s]
 63% 107184128/170498071 [00:04<00:02, 25529738.58it/s]
 65% 110362624/170498071 [00:04<00:02, 27354838.87it/s]
 67% 113410048/170498071 [00:04<00:02, 28003287.49it/s]
 68% 116457472/170498071 [00:04<00:01, 28584688.95it/s]
 70% 119341056/170498071 [00:04<00:01, 28467744.40it/s]
 72% 122191872/170498071 [00:04<00:01, 27357032.69it/s]
 73% 124944384/170498071 [00:04<00:01, 27076107.47it/s]
 75% 128548864/170498071 [00:04<00:01, 29572523.94it/s]
 77% 131891200/170498071 [00:05<00:01, 30681280.76it/s]
 79% 135299072/170498071 [00:05<00:01, 31556313.85it/s]
 81% 138608640/170498071 [00:05<00:00, 31907727.24it/s]
 83% 141885440/170498071 [00:05<00:00, 32121077.41it/s]
 85% 145260544/170498071 [00:05<00:00, 32430257.49it/s]
 87% 148766720/170498071 [00:05<00:00, 33141756.31it/s]
 89% 152141824/170498071 [00:05<00:00, 33279231.96it/s]
 91% 155680768/170498071 [00:05<00:00, 33877287.66it/s]
 94% 159514624/170498071 [00:05<00:00, 35200983.04it/s]
 96% 163381248/170498071 [00:05<00:00, 36120545.90it/s]
 98% 167280640/170498071 [00:06<00:00, 36818707.01it/s]
100% 170498071/170498071 [00:06<00:00, 27589781.71it/s]
Extracting /var/lib/jenkins/workspace/beginner_source/data/cifar-10-python.tar.gz to /var/lib/jenkins/workspace/beginner_source/data
Files already downloaded and verified
2023-11-15 00:41:41,211 WARNING services.py:1816 -- WARNING: The object store is using /tmp instead of /dev/shm because /dev/shm has only 2147479552 bytes available. This will harm performance! You may be able to free up space by deleting files in /dev/shm. If you are inside a Docker container, you can increase /dev/shm size by passing '--shm-size=10.24gb' to 'docker run' (or add it to the run_options list in a Ray cluster config). Make sure to set this to more than 30% of available RAM.
2023-11-15 00:41:41,352 INFO worker.py:1625 -- Started a local Ray instance.
2023-11-15 00:41:42,538 INFO tune.py:218 -- Initializing Ray automatically. For cluster usage or custom Ray initialization, call `ray.init(...)` before `tune.run(...)`.
== Status ==
Current time: 2023-11-15 00:41:47 (running for 00:00:05.36)
Using AsyncHyperBand: num_stopped=0
Bracket: Iter 8.000: None | Iter 4.000: None | Iter 2.000: None | Iter 1.000: None
Logical resource usage: 2.0/16 CPUs, 0/1 GPUs (0.0/1.0 accelerator_type:M60)
Result logdir: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42
Number of trials: 10/10 (9 PENDING, 1 RUNNING)
+-------------------------+----------+-----------------+--------------+------+------+-------------+
| Trial name              | status   | loc             |   batch_size |   l1 |   l2 |          lr |
|-------------------------+----------+-----------------+--------------+------+------+-------------|
| train_cifar_bf064_00000 | RUNNING  | 172.17.0.2:2681 |            2 |   16 |    1 | 0.00213327  |
| train_cifar_bf064_00001 | PENDING  |                 |            4 |    1 |    2 | 0.013416    |
| train_cifar_bf064_00002 | PENDING  |                 |            2 |  256 |   64 | 0.0113784   |
| train_cifar_bf064_00003 | PENDING  |                 |            8 |   64 |  256 | 0.0274071   |
| train_cifar_bf064_00004 | PENDING  |                 |            4 |   16 |    2 | 0.056666    |
| train_cifar_bf064_00005 | PENDING  |                 |            4 |    8 |   64 | 0.000353097 |
| train_cifar_bf064_00006 | PENDING  |                 |            8 |   16 |    4 | 0.000147684 |
| train_cifar_bf064_00007 | PENDING  |                 |            8 |  256 |  256 | 0.00477469  |
| train_cifar_bf064_00008 | PENDING  |                 |            8 |  128 |  256 | 0.0306227   |
| train_cifar_bf064_00009 | PENDING  |                 |            2 |    2 |   16 | 0.0286986   |
+-------------------------+----------+-----------------+--------------+------+------+-------------+


(func pid=2681) 文件已下载并验证
(func pid=2681) 文件已下载并验证
== 状态 ==
当前时间: 2023-11-15 00:41:53 (运行时间 00: 00:10.69)
使用 AsyncHyperBand:num_stopped=0
括号:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000:无
逻辑资源使用:4.0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录:/var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42
编号试验次数:10/10(8 个待定,2 个正在运行)
+------------------------+-----------------------------+--------------+------+------+- ------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | lr |
|------------------------+----------+---------------+--------------+------+------+---------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 |
|火车_cifar_bf064_00001 |跑步 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 |
|火车_cifar_bf064_00002 |待定 | | 2 | 256 | 256 64 | 64 0.0113784 |
|火车_cifar_bf064_00003 |待定 | | 8 | 64 | 64 256 | 256 0.0274071 |
|火车_cifar_bf064_00004 |待定 | | 4 | 16 | 16 2 | 0.056666 |
|火车_cifar_bf064_00005 |待定 | | 4 | 8 | 64 | 64 0.000353097 |
|火车_cifar_bf064_00006 |待定 | | 8 | 16 | 16 4 | 0.000147684 |
|火车_cifar_bf064_00007 |待定 | | 8 | 256 | 256 256 | 256 0.00477469 |
|火车_cifar_bf064_00008 |待定 | | 8 | 128 | 128 256 | 256 0.0306227 |
|火车_cifar_bf064_00009 |待定 | | 2 | 2 | 16 | 16 0.0286986 |
+-------------------------+---------+------------------+--------------+------+-----+------------ -+


(func pid=2752) 文件已下载并验证[跨集群重复 2 次](Ray 默认情况下会删除重复日志。设置 RAY_DEDUP_LOGS=0 以禁用日志重复删除,或参阅 https://docs.ray.io/en/master/ray -observability/ray-logging.html#log-deduplication 了解更多选项。)
== 状态 ==
当前时间:2023-11-15 00:41:58(运行时​​间 00:00:16.14)
使用 AsyncHyperBand : num_stopped=0
括号: Iter 8.000: 无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: 无
逻辑资源使用: 6.0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42
Number试验次数:10/10(7 个待处理,3 个正在运行)
+------------------------+-----------------------------+--------------+------+------+- ------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | lr |
|------------------------+----------+---------------+--------------+------+------+---------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 |
|火车_cifar_bf064_00001 |跑步 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 |
|火车_cifar_bf064_00003 |待定 | | 8 | 64 | 64 256 | 256 0.0274071 |
|火车_cifar_bf064_00004 |待定 | | 4 | 16 | 16 2 | 0.056666 |
|火车_cifar_bf064_00005 |待定 | | 4 | 8 | 64 | 64 0.000353097 |
|火车_cifar_bf064_00006 |待定 | | 8 | 16 | 16 4 | 0.000147684 |
|火车_cifar_bf064_00007 |待定 | | 8 | 256 | 256 256 | 256 0.00477469 |
|火车_cifar_bf064_00008 |待定 | | 8 | 128 | 128 256 | 256 0.0306227 |
|火车_cifar_bf064_00009 |待定 | | 2 | 2 | 16 | 16 0.0286986 |
+-------------------------+---------+------------------+--------------+------+-----+------------ -+


(func pid=2681) [1, 2000] 丢失:2.330
(func pid=3240) 文件已下载并验证[跨集群重复 2 次]
== 状态 ==
当前时间:2023-11-15 00 :42:04(运行 00:00:21.80)
使用 AsyncHyperBand:num_stopped=0
括号:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: 无
逻辑资源使用: 8.0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42
Number试验次数:10/10(6 个待定,4 个正在运行)
+------------------------+-----------------------------+--------------+------+------+- ------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | lr |
|------------------------+----------+---------------+--------------+------+------+---------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 |
|火车_cifar_bf064_00001 |跑步 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 |
|火车_cifar_bf064_00004 |待定 | | 4 | 16 | 16 2 | 0.056666 |
|火车_cifar_bf064_00005 |待定 | | 4 | 8 | 64 | 64 0.000353097 |
|火车_cifar_bf064_00006 |待定 | | 8 | 16 | 16 4 | 0.000147684 |
|火车_cifar_bf064_00007 |待定 | | 8 | 256 | 256 256 | 256 0.00477469 |
|火车_cifar_bf064_00008 |待定 | | 8 | 128 | 128 256 | 256 0.0306227 |
|火车_cifar_bf064_00009 |待定 | | 2 | 2 | 16 | 16 0.0286986 |
+-------------------------+---------+------------------+--------------+------+-----+------------ -+


(func pid=2752) [1, 2000] 丢失:2.313
(func pid=3728) 已下载并验证的文件 [跨集群重复 2 次]
(func pid=2681) [1, 4000] 丢失:1.152\ n== 状态 ==
当前时间:2023-11-15 00:42:11(运行时间为 00:00:28.43)
使用 AsyncHyperBand:num_stopped=0
括号:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: 无
逻辑资源使用: 10.0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42
Number试验次数:10/10(5 个待定,5 个正在运行)
+------------------------+-----------------------------+--------------+------+------+- ------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | lr |
|------------------------+----------+---------------+--------------+------+------+---------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 |
|火车_cifar_bf064_00001 |跑步 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 |
|火车_cifar_bf064_00004 |跑步 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 |
|火车_cifar_bf064_00005 |待定 | | 4 | 8 | 64 | 64 0.000353097 |
|火车_cifar_bf064_00006 |待定 | | 8 | 16 | 16 4 | 0.000147684 |
|火车_cifar_bf064_00007 |待定 | | 8 | 256 | 256 256 | 256 0.00477469 |
|火车_cifar_bf064_00008 |待定 | | 8 | 128 | 128 256 | 256 0.0306227 |
|火车_cifar_bf064_00009 |待定 | | 2 | 2 | 16 | 16 0.0286986 |
+-------------------------+---------+------------------+--------------+------+-----+------------ -+


(func pid=4217) 文件已下载并验证
(func pid=4217) 文件已下载并验证
(func pid=3240) 文件已下载并验证
(func pid=2752) [1, 4000]损失:1.155
== 状态 ==
当前时间:2023-11-15 00:42:18(运行时​​间为 00:00:35.41)
使用 AsyncHyperBand:num_stopped=0
Bracket:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: 无
逻辑资源使用: 12.0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42
Number试验次数:10/10(4 个待定,6 个正在运行)
+------------------------+-----------------------------+--------------+------+------+- ------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | lr |
|------------------------+----------+---------------+--------------+------+------+---------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 |
|火车_cifar_bf064_00001 |跑步 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 |
|火车_cifar_bf064_00004 |跑步 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 |
|火车_cifar_bf064_00006 |待定 | | 8 | 16 | 16 4 | 0.000147684 |
|火车_cifar_bf064_00007 |待定 | | 8 | 256 | 256 256 | 256 0.00477469 |
|火车_cifar_bf064_00008 |待定 | | 8 | 128 | 128 256 | 256 0.0306227 |
|火车_cifar_bf064_00009 |待定 | | 2 | 2 | 16 | 16 0.0286986 |
+-------------------------+---------+------------------+--------------+------+-----+------------ -+


(func pid=4708) 文件已下载并验证
(func pid=3728) [1, 2000] 丢失:2.101 [跨集群重复 2x]
(func pid=4708) 文件已下载并验证
(func pid=2752) [1, 6000] 损失: 0.769 [跨集群重复 3 次]
== 状态 ==
当前时间: 2023-11-15 00:42:26 (运行 00:00:44.31)
使用AsyncHyperBand:num_stopped=0
括号:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: 无
逻辑资源使用: 14.0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42
Number试验次数:10/10(3 个待定,7 个正在运行)
+------------------------+-----------------------------+--------------+------+------+- ------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | lr |
|------------------------+----------+---------------+--------------+------+------+---------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 |
|火车_cifar_bf064_00001 |跑步 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 |
|火车_cifar_bf064_00004 |跑步 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 |
|火车_cifar_bf064_00006 |跑步 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 |
|火车_cifar_bf064_00007 |待定 | | 8 | 256 | 256 256 | 256 0.00477469 |
|火车_cifar_bf064_00008 |待定 | | 8 | 128 | 128 256 | 256 0.0306227 |
|火车_cifar_bf064_00009 |待定 | | 2 | 2 | 16 | 16 0.0286986 |
+-------------------------+---------+------------------+--------------+------+-----+------------ -+


(func pid=5199) 文件已下载并验证
(func pid=5199) 文件已下载并验证
(func pid=3728) [1, 4000] 损失:1.033 [跨集群重复 2 次]
==状态 ==
当前时间:2023-11-15 00:42:34(运行时间为 00:00:52.20)
使用 AsyncHyperBand:num_stopped=0
括号:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: 无
逻辑资源使用: 16.0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42
Number试验次数:10/10(2 个待定,8 个正在运行)
+------------------------+-----------------------------+--------------+------+------+- ------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | lr |
|------------------------+----------+---------------+--------------+------+------+---------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 |
|火车_cifar_bf064_00001 |跑步 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 |
|火车_cifar_bf064_00004 |跑步 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 |
|火车_cifar_bf064_00006 |跑步 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 |
|火车_cifar_bf064_00008 |待定 | | 8 | 128 | 128 256 | 256 0.0306227 |
|火车_cifar_bf064_00009 |待定 | | 2 | 2 | 16 | 16 0.0286986 |
+-------------------------+---------+------------------+--------------+------+-----+------------ -+


(func pid=5692) 文件已下载并验证
(func pid=5692) 文件已下载并验证
== 状态 ==
当前时间: 2023-11-15 00:42:39 (运行时间 00: 00:57.21)
使用 AsyncHyperBand:num_stopped=0
括号:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: 无
逻辑资源使用: 16.0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42
Number试验次数:10/10(2 个待定,8 个正在运行)
+------------------------+-----------------------------+--------------+------+------+- ------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | lr |
|------------------------+----------+---------------+--------------+------+------+---------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 |
|火车_cifar_bf064_00001 |跑步 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 |
|火车_cifar_bf064_00004 |跑步 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 |
|火车_cifar_bf064_00006 |跑步 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 |
|火车_cifar_bf064_00008 |待定 | | 8 | 128 | 128 256 | 256 0.0306227 |
|火车_cifar_bf064_00009 |待定 | | 2 | 2 | 16 | 16 0.0286986 |
+-------------------------+---------+------------------+--------------+------+-----+------------ -+


(func pid=3240) [1, 6000] 损失:0.772 [跨集群重复 5 次]
== 状态 ==
当前时间:2023-11-15 00:42:44(运行时间为 00:01:02.22) 
使用 AsyncHyperBand:num_stopped=0
括号:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: 无
逻辑资源使用: 16.0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42
Number试验次数:10/10(2 个待定,8 个正在运行)
+------------------------+-----------------------------+--------------+------+------+- ------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | lr |
|------------------------+----------+---------------+--------------+------+------+---------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 |
|火车_cifar_bf064_00001 |跑步 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 |
|火车_cifar_bf064_00004 |跑步 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 |
|火车_cifar_bf064_00006 |跑步 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 |
|火车_cifar_bf064_00008 |待定 | | 8 | 128 | 128 256 | 256 0.0306227 |
|火车_cifar_bf064_00009 |待定 | | 2 | 2 | 16 | 16 0.0286986 |
+-------------------------+---------+------------------+--------------+------+-----+------------ -+


(func pid=4708) [1, 4000] 损失:1.121 [跨集群重复 2 倍]
train_cifar_bf064_00003 的结果:
 准确度:0.2254
 日期:2023-11-15_00-42-47
 完成:false
主机名:47e6634b378a \ n iterations_since_restore:1 \ n损失:2.028742760181427 \ n node_ip:172.17.0.2 \ n pid:3728 \ n should_checkpoint:true \ n time_since_restore:43.234081745147705 \ n time_this_it er_s: 43.234081745147705
 time_total_s: 43.234081745147705
 时间戳: 1700008967 
 Training_iteration: 1
 Trial_id: bf064_00003

== 状态 ==
当前时间: 2023-11-15 00:42:52 (运行 00:01:10.06)
使用 AsyncHyperBand: num_stopped=0\ nBracket:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: -2.028742760181427
逻辑资源使用: 16.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(2 个待处理,8 个正在运行)
+------------------------+-----------+----------------+--------------+------+-----+ ----------+--------+--------------------+-------- -+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|-------------------------+----------+---------------+--------------+------+------+---------------+--------+--------------------+---------+-------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | | | | |
|火车_cifar_bf064_00001 |跑步 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | | | | |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 1 | 43.2341 | 2.02874 | 0.2254 |
|火车_cifar_bf064_00004 |跑步 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 | | | |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 | | | |
|火车_cifar_bf064_00006 |跑步 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | | | | |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | | | | |
|火车_cifar_bf064_00008 |待定 | | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 | | | |
|火车_cifar_bf064_00009 |待定 | | 2 | 2 | 16 | 16 0.0286986 | | | | |
+-------------------------+----------+------------------+--------------+------+------+---------------- +--------+------------------+---------+------------ -+


(func pid=5692) [1, 2000] 损失:1.855 [跨集群重复 4 次]
== 状态 ==
当前时间:2023-11-15 00:42:57(运行时间为 00:01:15.06) 
使用 AsyncHyperBand:num_stopped=0
括号:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: -2.028742760181427
逻辑资源使用: 16.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(2 个待处理,8 个正在运行)
+------------------------+-----------+----------------+--------------+------+-----+ ----------+--------+--------------------+-------- -+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|-------------------------+----------+---------------+--------------+------+------+---------------+--------+--------------------+---------+-------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | | | | |
|火车_cifar_bf064_00001 |跑步 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | | | | |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 1 | 43.2341 | 2.02874 | 0.2254 |
|火车_cifar_bf064_00004 |跑步 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 | | | |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 | | | |
|火车_cifar_bf064_00006 |跑步 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | | | | |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | | | | |
|火车_cifar_bf064_00008 |待定 | | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 | | | |
|火车_cifar_bf064_00009 |待定 | | 2 | 2 | 16 | 16 0.0286986 | | | | |
+-------------------------+----------+------------------+--------------+------+------+---------------- +--------+------------------+---------+------------ -+


train_cifar_bf064_00001 的结果:
 准确度:0.1008
 日期:2023-11-15_00-43-00
 完成:true
 主机名:47e6634b378a
 iterations_since_restore:1
 损失:2.3081862998008726
 node_ip:1 72.17.0.2\ n pid:2752 \ n should_checkpoint:true \ n time_since_restore:67.53077411651611 \ n time_this_iter_s:67.53077411651611 \ n time_total_s:67.53077411651611 \ n时间戳:1700008980 \ n Training_iteration : 1
 Trial_id: bf064_00001

试验 train_cifar_bf064_00001 已完成。
(func pid=2681) [1, 14000] 丢失:0.329 [跨集群重复 4 次]
(func pid=2752) 文件已下载并验证
(func pid=2752) 文件已下载并验证
== 状态 = =
当前时间:2023-11-15 00:43:05(运行时间为 00:01:23.27)
使用 AsyncHyperBand:num_stopped=1
括号:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: -2.16846452999115
逻辑资源使用: 16.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 个待处理、8 个正在运行、1 个已终止)
+-------------------------+-------------+-----------------+--------------+------+-------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | | | | |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 1 | 43.2341 | 2.02874 | 0.2254 |
|火车_cifar_bf064_00004 |跑步 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 | | | |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 | | | |
|火车_cifar_bf064_00006 |跑步 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | | | | |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | | | | |
|火车_cifar_bf064_00008 |跑步 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 | | | |
|火车_cifar_bf064_00009 |待定 | | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=5692) [1, 4000] 损失:0.780 [跨集群重复 3 次]
== 状态 ==
当前时间:2023-11-15 00:43:10(运行时间 00:01:28.28) 
使用 AsyncHyperBand:num_stopped=1
括号:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: -2.16846452999115
逻辑资源使用: 16.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 个待处理、8 个正在运行、1 个已终止)
+-------------------------+-------------+-----------------+--------------+------+-------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | | | | |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 1 | 43.2341 | 2.02874 | 0.2254 |
|火车_cifar_bf064_00004 |跑步 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 | | | |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 | | | |
|火车_cifar_bf064_00006 |跑步 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | | | | |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | | | | |
|火车_cifar_bf064_00008 |跑步 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 | | | |
|火车_cifar_bf064_00009 |待定 | | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00006 的结果:
 准确度:0.1689
 日期:2023-11-15_00-43-11
 完成:true
 主机名:47e6634b378a
 iterations_since_restore:1
 损失:2.2780231006622316
 node_ip:1 72.17.0.2\ n pid:5199 \ n should_checkpoint:true \ n time_since_restore:44.75676083564758 \ n time_this_iter_s:44.75676083564758 \ n time_total_s:44.75676083564758 \ n时间戳:1700008991 \ n Training_iteration : 1
 Trial_id: bf064_00006

试验 train_cifar_bf064_00006 已完成。
(func pid=5199) 文件已下载并验证
(func pid=5199) 文件已下载并验证
== 状态 ==
当前时间: 2023-11-15 00:43:16 (运行时间 00:01: 34.11)
使用 AsyncHyperBand:num_stopped=2
括号:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: -2.2780231006622316
逻辑资源使用: 16.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(8 次运行,2 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | | | | |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 1 | 43.2341 | 2.02874 | 0.2254 |
|火车_cifar_bf064_00004 |跑步 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 | | | |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 | | | |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | | | | |
|火车_cifar_bf064_00008 |跑步 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 | | | |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=2752) [1, 2000] 损失:2.108 [跨集群重复 5 次]
== 状态 ==
当前时间:2023-11-15 00:43:21(运行时间为 00:01:39.12) 
使用 AsyncHyperBand:num_stopped=2
括号:Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: -2.2780231006622316
逻辑资源使用: 16.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(8 次运行,2 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | | | | |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 1 | 43.2341 | 2.02874 | 0.2254 |
|火车_cifar_bf064_00004 |跑步 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 | | | |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 | | | |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | | | | |
|火车_cifar_bf064_00008 |跑步 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 | | | |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=2681) [1, 18000] 损失:0.256 [跨集群重复 2 次]
train_cifar_bf064_00007 的结果:
 准确度:0.4756
 日期:2023-11-15_00-43-24
 完成:false
主机名:47e6634b378a \ n iterations_since_restore:1 \ n损失:1.451918194580078 \ n node_ip:172.17.0.2 \ n pid:5692 \ n should_checkpoint:true \ n time_since_restore:49.35653567314148 \ n time_this_iter _s: 49.35653567314148
 time_total_s: 49.35653567314148
 时间戳: 1700009004 
 Training_iteration: 1
 Trial_id: bf064_00007

train_cifar_bf064_00004 的结果:
 准确度: 0.0986
 日期: 2023-11-15_00-43-24
 完成: true
 主机名: 47e6634b378a
 iterations_since_restore: 1 
 损失:2.3359092045783996
 node_ip:172.17.0.2
 pid:4217
 should_checkpoint:true
 time_since_restore:73.20940923690796
 time_this_iter_s:73.20940923690796
 time_total_ s: 73.20940923690796
 时间戳: 1700009004
 训练迭代: 1
 Trial_id : bf064_00004

试用 train_cifar_bf064_00004 已完成。
== 状态 ==
当前时间: 2023-11-15 00:43:29 (运行时间 00:01:46.67)
使用 AsyncHyperBand: num_stopped=3
Bracket: Iter 8.000:无 | Iter 4.000:无 | Iter 2.000:无 | Iter 1.000: -2.2780231006622316
逻辑资源使用: 14.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(7 次运行,3 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | | | | |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 1 | 43.2341 | 2.02874 | 0.2254 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 | | | |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 1 | 49.3565 | 49.3565 1.45192 | 0.4756 |
|火车_cifar_bf064_00008 |跑步 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 | | | |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=3240) [1, 12000] 损失: 0.386 [跨集群重复 3 次]
train_cifar_bf064_00003 的结果:
 准确度: 0.2368
 日期: 2023-11-15_00-43-32
 完成: false
主机名:47e6634b378a \ n iterations_since_restore:2 \ n损失:2.0462085193634034 \ n node_ip:172.17.0.2 \ n pid:3728 \ n should_checkpoint:true \ n time_since_restore:87.77183723449707 \ n time_this_it er_s: 44.537755489349365
 time_total_s: 87.77183723449707
 时间戳: 1700009012 
 Training_iteration: 2
 Trial_id: bf064_00003

train_cifar_bf064_00005 的结果:
 准确度: 0.3659
 日期: 2023-11-15_00-43-33
 完成: false
 主机名: 47e6634b378a
 iterations_since_restore: 1 
 损失:1.6868873691082
 node_ip:172.17.0.2
 pid:4708
 should_checkpoint:true
 time_since_restore:75.3049852848053
 time_this_iter_s:75.3049852848053
 time_total_s:75.3049852848053
 时间戳:1700009013
 训练迭代:1
 Trial_id : bf064_00005

(func pid=5199) [1, 4000] 损失: 1.169 [跨集群重复 3 次]
== 状态 ==
当前时间: 2023-11-15 00:43:38 (运行时间00:01:55.74)
使用 AsyncHyperBand:num_stopped=3
括号:Iter 8.000:无 | Iter 4.000:无 |迭代 2.000:-2.0462085193634034 | Iter 1.000: -2.153382930421829
逻辑资源使用: 14.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(7 次运行,3 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | | | | |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 2 | 87.7718 | 2.04621 | 0.2368 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 1 | 75.305 | 75.305 1.68689 | 0.3659 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 1 | 49.3565 | 49.3565 1.45192 | 0.4756 |
|火车_cifar_bf064_00008 |跑步 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 | | | |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:43:43(运行时间为 00:02:00.76)
使用 AsyncHyperBand:num_stopped=3
括号:Iter 8.000:无 | Iter 4.000:无 |迭代 2.000:-2.0462085193634034 | Iter 1.000: -2.153382930421829
逻辑资源使用: 14.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(7 次运行,3 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | | | | |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 2 | 87.7718 | 2.04621 | 0.2368 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 1 | 75.305 | 75.305 1.68689 | 0.3659 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 1 | 49.3565 | 49.3565 1.45192 | 0.4756 |
|火车_cifar_bf064_00008 |跑步 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 | | | |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [2, 2000] 损失:1.656 [跨集群重复 2 倍]
train_cifar_bf064_00008 的结果:
 准确度:0.2189
 日期:2023-11-15_00-43-48
 完成:true
主机名:47e6634b378a \ n iterations_since_restore:1 \ n损失:2.180678374862671 \ n node_ip:172.17.0.2 \ n pid:2752 \ n should_checkpoint:true \ n time_since_restore:47.31365776062012 \ n time_this_iter _s: 47.31365776062012
 time_total_s: 47.31365776062012
 时间戳: 1700009028 
 Training_iteration:1
 Trial_id:bf064_00008

试验 train_cifar_bf064_00008 已完成。
train_cifar_bf064_00000 的结果:
 准确度:0.102
 日期:2023-11-15_00-43-50
 完成:true
 主机名:47e663 4b378a 
 iterations_since_restore: 1
 损失: 2.303914258527756
 node_ip: 172.17.0.2
 pid: 2681
 should_checkpoint: true
 time_since_restore: 123.00910973548889
 time_this_iter_s: 123.0091097 3548889
 总时间:123.00910973548889
 时间戳:1700009030
 训练迭代: 1
 Trial_id: bf064_00000

试验 train_cifar_bf064_00000 已完成。
== 状态 ==
当前时间: 2023-11-15 00:43:51 (运行时间 00:02:08.39)
使用 AsyncHyperBand: num_stopped= 5
括号:Iter 8.000:无 | Iter 4.000:无 |迭代 2.000:-2.0462085193634034 | Iter 1.000: -2.2293507377624513
逻辑资源使用: 12.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(6 次运行,4 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00000 |跑步 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 2 | 87.7718 | 2.04621 | 0.2368 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 1 | 75.305 | 75.305 1.68689 | 0.3659 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 1 | 49.3565 | 49.3565 1.45192 | 0.4756 |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=5692) [2, 4000] 损失:0.692 [跨集群重复 4 次]
== 状态 ==
当前时间:2023-11-15 00:43:56(运行时间 00:02:13.41) 
使用 AsyncHyperBand:num_stopped=5
括号:Iter 8.000:无 | Iter 4.000:无 |迭代 2.000:-2.0462085193634034 | Iter 1.000: -2.2293507377624513
逻辑资源使用: 10.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(5 次运行,5 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 2 | 87.7718 | 2.04621 | 0.2368 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 1 | 75.305 | 75.305 1.68689 | 0.3659 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 1 | 49.3565 | 49.3565 1.45192 | 0.4756 |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=3728) [3, 4000] 损失:1.050 [跨集群重复 3 次]
== 状态 ==
当前时间:2023-11-15 00:44:01(运行时间为 00:02:18.41) 
使用 AsyncHyperBand:num_stopped=5
括号:Iter 8.000:无 | Iter 4.000:无 |迭代 2.000:-2.0462085193634034 | Iter 1.000: -2.2293507377624513
逻辑资源使用: 10.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(5 次运行,5 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 2 | 87.7718 | 2.04621 | 0.2368 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 1 | 75.305 | 75.305 1.68689 | 0.3659 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 1 | 49.3565 | 49.3565 1.45192 | 0.4756 |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00007 的结果:
 准确度:0.5254
 日期:2023-11-15_00-44-05
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:2
 损失:1.3188089747428895
 node_ip:1 72.17.0.2\ n pid:5692 \ n should_checkpoint:true \ n time_since_restore:91.16031050682068 \ n time_this_iter_s:41.8037748336792 \ n time_total_s:91.16031050682068 \ n时间戳:1700009045 \ n Training_iteration: 2
 Trial_id: bf064_00007

(func pid=5199) [ 1, 10000] 丢失:0.468 [跨集群重复 2 倍]
== 状态 ==
当前时间:2023-11-15 00:44:11(运行 00:02:28.38)
使用 AsyncHyperBand:num_stopped=5 
括号:Iter 8.000:无 | Iter 4.000:无 |迭代 2.000:-1.6825087470531463 | Iter 1.000: -2.2293507377624513
逻辑资源使用: 10.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(5 次运行,5 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 2 | 87.7718 | 2.04621 | 0.2368 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 1 | 75.305 | 75.305 1.68689 | 0.3659 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 2 | 91.1603 | 91.1603 1.31881 | 0.5254 |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00003 的结果:
 准确度:0.2211
 日期:2023-11-15_00-44-12
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:3
 损失:2.0626959712028503
 node_ip:1 72.17.0.2\ n pid:3728 \ n should_checkpoint:true \ n time_since_restore:127.77058959007263 \ n time_this_iter_s:39.99875235557556 \ n time_total_s:127.77058959007263 \ n时间戳:1700009052 \ n Training_迭代:3
 Trial_id:bf064_00003

(func pid=3240) [ 1, 18000] 丢失:0.257 [跨集群重复 2 倍]
== 状态 ==
当前时间:2023-11-15 00:44:17(运行 00:02:34.58)
使用 AsyncHyperBand:num_stopped=5 
括号:Iter 8.000:无 | Iter 4.000:无 |迭代 2.000:-1.6825087470531463 | Iter 1.000: -2.2293507377624513
逻辑资源使用: 10.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(5 次运行,5 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 3 | 127.771 | 127.771 2.0627 | 2.0627 0.2211 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 1 | 75.305 | 75.305 1.68689 | 0.3659 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 2 | 91.1603 | 91.1603 1.31881 | 0.5254 |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:44:22(运行时间为 00:02:39.59)
使用 AsyncHyperBand:num_stopped=5
括号:Iter 8.000:无 | Iter 4.000:无 |迭代 2.000:-1.6825087470531463 | Iter 1.000: -2.2293507377624513
逻辑资源使用: 10.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(5 次运行,5 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 3 | 127.771 | 127.771 2.0627 | 2.0627 0.2211 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 1 | 75.305 | 75.305 1.68689 | 0.3659 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 2 | 91.1603 | 91.1603 1.31881 | 0.5254 |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=5199) [1, 14000] 损失:0.334 [跨集群重复 4 次]
== 状态 ==
当前时间:2023-11-15 00:44:27(运行时间为 00:02:44.60) 
使用 AsyncHyperBand:num_stopped=5
括号:Iter 8.000:无 | Iter 4.000:无 |迭代 2.000:-1.6825087470531463 | Iter 1.000: -2.2293507377624513
逻辑资源使用: 10.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(5 次运行,5 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 3 | 127.771 | 127.771 2.0627 | 2.0627 0.2211 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 1 | 75.305 | 75.305 1.68689 | 0.3659 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 2 | 91.1603 | 91.1603 1.31881 | 0.5254 |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:44:32(运行时间为 00:02:49.61)
使用 AsyncHyperBand:num_stopped=5
括号:Iter 8.000:无 | Iter 4.000:无 |迭代 2.000:-1.6825087470531463 | Iter 1.000: -2.2293507377624513
逻辑资源使用: 10.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(5 次运行,5 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 3 | 127.771 | 127.771 2.0627 | 2.0627 0.2211 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 1 | 75.305 | 75.305 1.68689 | 0.3659 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 2 | 91.1603 | 91.1603 1.31881 | 0.5254 |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=5692) [3, 4000] 损失: 0.629 [跨集群重复 4 次]
train_cifar_bf064_00005 的结果:
 准确度: 0.4406
 日期: 2023-11-15_00-44-36
 完成: false
主机名:47e6634b378a \ n iterations_since_restore:2 \ n损失:1.4916184381961823 \ n node_ip:172.17.0.2 \ n pid:4708 \ n should_checkpoint:true \ n time_since_restore:138.08053255081177 \ n time_this_ iter_s: 62.77554726600647
 time_total_s: 138.08053255081177
 时间戳: 1700009076 
 Training_iteration: 2
 Trial_id: bf064_00005

(func pid=3728) [4, 4000] 损失: 1.155 [跨集群重复 2 次]
== 状态 ==
当前时间: 2023-11-15 00:44:41(运行 00:02:58.50)
使用 AsyncHyperBand:num_stopped=5
括号:Iter 8.000:无 | Iter 4.000:无 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.2293507377624513
逻辑资源使用: 10.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(5 次运行,5 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00002 |跑步 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | | | | |
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 3 | 127.771 | 127.771 2.0627 | 2.0627 0.2211 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 2 | 138.081 | 1.49162 | 0.4406 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 2 | 91.1603 | 91.1603 1.31881 | 0.5254 |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00002 的结果:
 准确度:0.1028
 日期:2023-11-15_00-44-44
 完成:true
 主机名:47e6634b378a
 iterations_since_restore:1
 损失:2.3167312343597413
 node_ip:1 72.17.0.2\ n pid:3240 \ n should_checkpoint:true \ n time_since_restore:165.89642000198364 \ n time_this_iter_s:165.89642000198364 \ n time_total_s:165.89642000198364 \ n时间戳:1700009084 \ n训练_iteration: 1
 Trial_id: bf064_00002

试验 train_cifar_bf064_00002 已完成。
train_cifar_bf064_00007 的结果:
 准确度:0.5636
 日期:2023-11-15_00-44-44
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:3
 损失:1.246082941901684
 node_ip:172.17.0.2
 pid: 5692
 should_checkpoint: true
 time_since_restore: 129.91541862487793
 time_this_iter_s: 38.75510811805725
 time_total_s: 129.91541862487793
 时间戳: 1700009084
 Training_iteration: 3
 试验 ID: bf064_00007

(func pid=4708) [3, 2000 ] 损失:1.495 [跨集群重复 2 倍]
== 状态 ==
当前时间:2023-11-15 00:44:49(运行时间 00:03:07.13)
使用 AsyncHyperBand:num_stopped=6
Bracket: Iter 8.000:无 | Iter 4.000:无 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.2780231006622316
逻辑资源使用: 8.0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(4 次运行,6 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 3 | 127.771 | 127.771 2.0627 | 2.0627 0.2211 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 2 | 138.081 | 1.49162 | 0.4406 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 3 | 129.915 | 129.915 1.24608 | 0.5636 |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00003 的结果:
 准确度:0.0984
 日期:2023-11-15_00-44-51
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:4
 损失:2.3100966709136963
 node_ip:1 72.17.0.2\ n pid:3728 \ n should_checkpoint:true \ n time_since_restore:166.9245798587799 \ n time_this_iter_s:39.153990268707275 \ n time_total_s:166.9245798587799 \ n时间戳:1700009091 \ n Training_it次数: 4
 Trial_id: bf064_00003

(func pid=5199) [ 1, 20000] 损失: 0.234
== 状态 ==
当前时间: 2023-11-15 00:44:56 (运行时间 00:03:13.74)
使用 AsyncHyperBand: num_stopped=6
Bracket: Iter 8.000:无 |迭代 4.000:-2.3100966709136963 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.2780231006622316
逻辑资源使用: 8.0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(4 次运行,6 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 4 | 166.925 | 166.925 2.3101 | 2.3101 0.0984 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 2 | 138.081 | 1.49162 | 0.4406 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 3 | 129.915 | 129.915 1.24608 | 0.5636 |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [3, 4000] 损失: 0.743
== 状态 ==
当前时间: 2023-11-15 00:45:01 (运行 00:03:18.75)
使用 AsyncHyperBand: num_stopped= 6
括号:Iter 8.000:无 |迭代 4.000:-2.3100966709136963 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.2780231006622316
逻辑资源使用: 8.0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(4 次运行,6 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 4 | 166.925 | 166.925 2.3101 | 2.3101 0.0984 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 2 | 138.081 | 1.49162 | 0.4406 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 3 | 129.915 | 129.915 1.24608 | 0.5636 |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=3728) [5, 2000] 损失:2.310 [跨集群重复 2 次]
== 状态 ==
当前时间:2023-11-15 00:45:06(运行 00:03:23.76) 
使用 AsyncHyperBand:num_stopped=6
括号:Iter 8.000:无 |迭代 4.000:-2.3100966709136963 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.2780231006622316
逻辑资源使用: 8.0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(4 次运行,6 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 4 | 166.925 | 166.925 2.3101 | 2.3101 0.0984 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 2 | 138.081 | 1.49162 | 0.4406 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 3 | 129.915 | 129.915 1.24608 | 0.5636 |
|火车_cifar_bf064_00009 |跑步 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | | | | |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00009 的结果:
 准确度:0.0994
 日期:2023-11-15_00-45-07
 完成:true
 主机名:47e6634b378a
 iterations_since_restore:1
 损失:2.3218609105825423
 node_ip:1 72.17.0.2\ n pid:5199
 should_checkpoint:true
 time_since_restore:116.12234020233154
 time_this_iter_s:116.12234020233154
 time_total_s:116.12234020233154
时间戳:1700009107
训练_iteration: 1
 Trial_id: bf064_00009

试验 train_cifar_bf064_00009 已完成。
(func pid=5692) [4, 4000] 损失: 0.598 [跨集群重复 2 次]
== 状态 ==
当前时间: 2023-11-15 00:45:12 (运行 00:03:30.25)
使用AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-2.3100966709136963 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 4 | 166.925 | 166.925 2.3101 | 2.3101 0.0984 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 2 | 138.081 | 1.49162 | 0.4406 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 3 | 129.915 | 129.915 1.24608 | 0.5636 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [3, 8000] 损失: 0.364
(func pid=3728) [5, 4000] 损失: 1.155
== 状态 ==
当前时间: 2023-11-15 00:45: 17(运行时间为 00:03:35.26)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-2.3100966709136963 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 4 | 166.925 | 166.925 2.3101 | 2.3101 0.0984 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 2 | 138.081 | 1.49162 | 0.4406 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 3 | 129.915 | 129.915 1.24608 | 0.5636 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00007 的结果:
 准确度:0.5565
 日期:2023-11-15_00-45-19
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:4
 损失:1.2592607187390328
 node_ip:1 72.17.0.2\ n pid:5692 \ n should_checkpoint:true \ n time_since_restore:164.99337315559387 \ n time_this_iter_s:35.07795453071594 \ n time_total_s:164.99337315559387 \ n时间戳:1700009119 \ n Training_迭代: 4
 Trial_id: bf064_00007

== 状态 ==
当前时间:2023-11-15 00:45:24(运行时间为 00:03:42.21)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.7846786948263644 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 4 | 166.925 | 166.925 2.3101 | 2.3101 0.0984 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 2 | 138.081 | 1.49162 | 0.4406 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 4 | 164.993 | 164.993 1.25926 | 1.25926 0.5565 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [3, 10000] 损失: 0.289
train_cifar_bf064_00003 的结果:
 准确度: 0.0994
 日期: 2023-11-15_00-45-27
 完成: false
 主机名: 47e6634b378a
 iterations_since_restore : 5
 丢失: 2.3078094497680666
 node_ip: 172.17.0.2
 pid: 3728
 should_checkpoint: true
 time_since_restore: 202.74810004234314
 time_this_iter_s: 35.82352018356323
 time_total_s: 202.74810004234314
 时间戳: 1700009127
 训练迭代: 5\ n Trial_id: bf064_00003

(func pid=5692) [5, 2000] 损失: 1.068
== 状态 ==
当前时间: 2023-11-15 00:45:32 (运行时间 00:03: 49.56)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.7846786948263644 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 5 | 202.748 | 2.30781 | 2.30781 0.0994 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 2 | 138.081 | 1.49162 | 0.4406 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 4 | 164.993 | 164.993 1.25926 | 1.25926 0.5565 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00005 的结果:
 准确度:0.4593
 日期:2023-11-15_00-45-32
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:3
 损失:1.4651956122279166
 node_ip:1 72.17.0.2\ n pid:4708 \ n should_checkpoint:true \ n time_since_restore:194.22449851036072 \ n time_this_iter_s:56.14396595954895 \ n time_total_s:194.22449851036072 \ n时间戳:1700009132 \ n Training_迭代: 3
 Trial_id: bf064_00005

== 状态 ==
当前时间:2023-11-15 00:45:37(运行时间为 00:03:54.65)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.7846786948263644 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 5 | 202.748 | 2.30781 | 2.30781 0.0994 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 3 | 194.224 | 194.224 1.4652 | 0.4593 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 4 | 164.993 | 164.993 1.25926 | 1.25926 0.5565 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=3728) [6, 2000] 损失: 2.310
(func pid=4708) [4, 2000] 损失: 1.408
== 状态 ==
当前时间: 2023-11-15 00:45: 42(运行 00:03:59.66)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.7846786948263644 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 5 | 202.748 | 2.30781 | 2.30781 0.0994 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 3 | 194.224 | 194.224 1.4652 | 0.4593 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 4 | 164.993 | 164.993 1.25926 | 1.25926 0.5565 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:45:47(运行时间为 00:04:04.67)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.7846786948263644 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 5 | 202.748 | 2.30781 | 2.30781 0.0994 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 3 | 194.224 | 194.224 1.4652 | 0.4593 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 4 | 164.993 | 164.993 1.25926 | 1.25926 0.5565 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [4, 4000] 损失:0.696 [跨集群重复 2 次]
== 状态 ==
当前时间:2023-11-15 00:45:52(运行时间 00:04:09.68) 
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.7846786948263644 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 5 | 202.748 | 2.30781 | 2.30781 0.0994 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 3 | 194.224 | 194.224 1.4652 | 0.4593 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 4 | 164.993 | 164.993 1.25926 | 1.25926 0.5565 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00007 的结果:
 准确度:0.575
 日期:2023-11-15_00-45-52
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:5
 损失:1.2706222375750542
 node_ip:17 2.17.0.2\ n pid:5692 \ n should_checkpoint:true \ n time_since_restore:198.0124270915985 \ n time_this_iter_s:33.01905393600464 \ n time_total_s:198.0124270915985 \ n时间戳:1700009152 \ n Training_iteration : 5
 Trial_id: bf064_00007

== 状态 ==
当前时间:2023-11-15 00:45:57(运行时间为 00:04:15.22)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.7846786948263644 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 5 | 202.748 | 2.30781 | 2.30781 0.0994 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 3 | 194.224 | 194.224 1.4652 | 0.4593 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 5 | 198.012 | 1.27062 | 0.575 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [4, 6000] 损失: 0.462 [跨集群重复 2 次]
train_cifar_bf064_00003 的结果:
 准确度: 0.0999
 日期: 2023-11-15_00-46-01
 完成: false
主机名:47e6634b378a \ n iterations_since_restore:6 \ n损失:2.3086511194229127 \ n node_ip:172.17.0.2 \ n pid:3728 \ n should_checkpoint:true \ n time_since_restore:236.95160698890686 \ n time_this_ iter_s: 34.20350694656372
 time_total_s: 236.95160698890686
 时间戳: 1700009161 
 Training_iteration: 6
 Trial_id: bf064_00003

(func pid=5692) [6, 2000] 损失: 1.030
== 状态 ==
当前时间: 2023-11-15 00:46:06 (运行时间为 00:04:23.76)
使用 AsyncHyperBand: num_stopped=7
Bracket: Iter 8.000: None |迭代 4.000:-1.7846786948263644 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 6 | 236.952 | 236.952 2.30865 | 2.30865 0.0999 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 3 | 194.224 | 194.224 1.4652 | 0.4593 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 5 | 198.012 | 1.27062 | 0.575 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [4, 8000] 损失: 0.340
== 状态 ==
当前时间: 2023-11-15 00:46:11 (运行 00:04:28.77)
使用 AsyncHyperBand: num_stopped= 7
括号:Iter 8.000:无 |迭代 4.000:-1.7846786948263644 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 6 | 236.952 | 236.952 2.30865 | 2.30865 0.0999 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 3 | 194.224 | 194.224 1.4652 | 0.4593 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 5 | 198.012 | 1.27062 | 0.575 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=3728) [7, 2000] 损失: 2.310
(func pid=5692) [6, 4000] 损失: 0.543
== 状态 ==
当前时间: 2023-11-15 00:46: 16(运行 00:04:33.78)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.7846786948263644 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 6 | 236.952 | 236.952 2.30865 | 2.30865 0.0999 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 3 | 194.224 | 194.224 1.4652 | 0.4593 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 5 | 198.012 | 1.27062 | 0.575 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:46:21(运行时间为 00:04:38.79)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.7846786948263644 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 6 | 236.952 | 236.952 2.30865 | 2.30865 0.0999 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 3 | 194.224 | 194.224 1.4652 | 0.4593 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 5 | 198.012 | 1.27062 | 0.575 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00005 的结果:
 准确度:0.4865
 日期:2023-11-15_00-46-24
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:4
 损失:1.4287443323016167
 node_ip:1 72.17.0.2\ n pid:4708 \ n should_checkpoint:true \ n time_since_restore:246.26441478729248 \ n time_this_iter_s:52.03991627693176 \ n time_total_s:246.26441478729248 \ n时间戳:1700009184 \ n Training_迭代:4
 Trial_id:bf064_00005

(func pid=3728) [ 7, 4000] 损失:1.155 [跨集群重复 2 倍]
train_cifar_bf064_00007 的结果:
 准确度:0.572
 日期:2023-11-15_00-46-26
 完成:false
 主机名:47e6634b378a
 iterations_since_restore: 6
 丢失:1.291559043586254
 node_ip:172.17.0.2
 pid:5692
 should_checkpoint:true
 time_since_restore:231.19213557243347
 time_this_iter_s:33.17970848083496
 time_to tal_s: 231.19213557243347
 时间戳: 1700009186
 训练迭代: 6
 Trial_id: bf064_00007

== 状态 ==
当前时间: 2023-11-15 00:46:31 (运行时间 00:04:48.41)
使用 AsyncHyperBand: num_stopped=7
Bracket: Iter 8.000: None |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 6 | 236.952 | 236.952 2.30865 | 2.30865 0.0999 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 4 | 246.264 | 246.264 1.42874 | 0.4865 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 6 | 231.192 | 231.192 1.29156 | 0.572 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [5, 2000] 损失: 1.306
train_cifar_bf064_00003 的结果:
 准确度: 0.0993
 日期: 2023-11-15_00-46-35
 完成: false
 主机名: 47e6634b378a
 iterations_since_restore : 7
 丢失: 2.306323892593384
 node_ip: 172.17.0.2
 pid: 3728
 should_checkpoint: true
 time_since_restore: 271.1397247314453
 time_this_iter_s: 34.18811774253845
 time_ Total_s: 271.1397247314453
 时间戳: 1700009195
 训练迭代: 7\ n Trial_id: bf064_00003

(func pid=5692) [7, 2000] 损失: 0.984
== 状态 ==
当前时间: 2023-11-15 00:46:40 (运行时间 00:04: 57.95)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 7 | 271.14 | 271.14 2.30632 | 2.30632 0.0993 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 4 | 246.264 | 246.264 1.42874 | 0.4865 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 6 | 231.192 | 231.192 1.29156 | 0.572 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:46:45(运行时间为 00:05:02.96)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 7 | 271.14 | 271.14 2.30632 | 2.30632 0.0993 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 4 | 246.264 | 246.264 1.42874 | 0.4865 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 6 | 231.192 | 231.192 1.29156 | 0.572 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=3728) [8, 2000] 损失:2.310 [跨集群重复 2 次]
== 状态 ==
当前时间:2023-11-15 00:46:50(运行时间 00:05:07.97) 
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 7 | 271.14 | 271.14 2.30632 | 2.30632 0.0993 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 4 | 246.264 | 246.264 1.42874 | 0.4865 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 6 | 231.192 | 231.192 1.29156 | 0.572 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:46:55(运行时间为 00:05:12.97)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 7 | 271.14 | 271.14 2.30632 | 2.30632 0.0993 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 4 | 246.264 | 246.264 1.42874 | 0.4865 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 6 | 231.192 | 231.192 1.29156 | 0.572 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00007 的结果:
 准确度:0.5584
 日期:2023-11-15_00-46-59
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:7
 损失:1.3122931097507478
 node_ip:1 72.17.0.2\ n pid:5692 \ n should_checkpoint:true \ n time_since_restore:264.2365355491638 \ n time_this_iter_s:33.04439997673035 \ n time_total_s:264.2365355491638 \ n时间戳:1700009219 \ n Training_iteration : 7
 Trial_id: bf064_00007

(func pid=3728) [ 8, 4000] 丢失:1.155 [跨集群重复 3 倍]
== 状态 ==
当前时间:2023-11-15 00:47:04(运行 00:05:21.45)
使用 AsyncHyperBand:num_stopped=7 
括号:Iter 8.000:无 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 7 | 271.14 | 271.14 2.30632 | 2.30632 0.0993 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 4 | 246.264 | 246.264 1.42874 | 0.4865 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 7 | 264.237 | 264.237 1.31229 | 1.31229 0.5584 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:47:09(运行时间为 00:05:26.46)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:无 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 7 | 271.14 | 271.14 2.30632 | 2.30632 0.0993 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 4 | 246.264 | 246.264 1.42874 | 0.4865 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 7 | 264.237 | 264.237 1.31229 | 1.31229 0.5584 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00003 的结果:
 准确度:0.0983
 日期:2023-11-15_00-47-09
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:8
 损失:2.3119888971328737
 node_ip:1 72.17.0.2\ n pid:3728
 should_checkpoint:true
 time_since_restore:305.2927396297455
 time_this_iter_s:34.15301489830017
 time_total_s:305.2927396297455
时间戳:1700009229
training_iteration : 8
 Trial_id: bf064_00003

(func pid=4708) [ 5, 10000] 丢失:0.257 [跨集群重复 2 倍]
== 状态 ==
当前时间:2023-11-15 00:47:14(运行 00:05:32.10)
使用 AsyncHyperBand:num_stopped=7 
括号:Iter 8.000:-2.3119888971328737 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 8 | 305.293 | 305.293 2.31199 | 2.31199 0.0983 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 4 | 246.264 | 246.264 1.42874 | 0.4865 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 7 | 264.237 | 264.237 1.31229 | 1.31229 0.5584 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00005 的结果:
 准确度:0.5344
 日期:2023-11-15_00-47-16
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:5
 损失:1.2956238314509392
 node_ip:1 72.17.0.2\ n pid:4708 \ n should_checkpoint:true \ n time_since_restore:298.829829454422 \ n time_this_iter_s:52.56541466712952 \ n time_total_s:298.829829454422 \ n时间戳:1700009236 \ n Training_iteration: 5
 Trial_id: bf064_00005

== 状态 ==
当前时间:2023-11-15 00:47:21(运行时间为 00:05:39.26)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:-2.3119888971328737 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 8 | 305.293 | 305.293 2.31199 | 2.31199 0.0983 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 5 | 298.83 | 298.83 1.29562 | 0.5344 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 7 | 264.237 | 264.237 1.31229 | 1.31229 0.5584 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=3728) [9, 2000] 损失:2.309 [跨集群重复 2 次]
== 状态 ==
当前时间:2023-11-15 00:47:26(运行时间为 00:05:44.27) 
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:-2.3119888971328737 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 8 | 305.293 | 305.293 2.31199 | 2.31199 0.0983 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 5 | 298.83 | 298.83 1.29562 | 0.5344 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 7 | 264.237 | 264.237 1.31229 | 1.31229 0.5584 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:47:31(运行时间为 00:05:49.28)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:-2.3119888971328737 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 8 | 305.293 | 305.293 2.31199 | 2.31199 0.0983 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 5 | 298.83 | 298.83 1.29562 | 0.5344 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 7 | 264.237 | 264.237 1.31229 | 1.31229 0.5584 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00007 的结果:
 准确度:0.5579
 日期:2023-11-15_00-47-32
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:8
 损失:1.345710159623623
 node_ip:17 2.17.0.2\ n pid:5692 \ n should_checkpoint:true \ n time_since_restore:297.4871952533722 \ n time_this_iter_s:33.250659704208374 \ n time_total_s:297.4871952533722 \ n时间戳:1700009252 \ n Training_it次数: 8
 Trial_id: bf064_00007

(func pid=3728) [ 9, 4000] 丢失:1.155 [跨集群重复 3 倍]
== 状态 ==
当前时间:2023-11-15 00:47:37(运行 00:05:54.70)
使用 AsyncHyperBand:num_stopped=7 
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 8 | 305.293 | 305.293 2.31199 | 2.31199 0.0983 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 5 | 298.83 | 298.83 1.29562 | 0.5344 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 8 | 297.487 | 297.487 1.34571 | 0.5579 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:47:42(运行时间为 00:05:59.71)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 8 | 305.293 | 305.293 2.31199 | 2.31199 0.0983 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 5 | 298.83 | 298.83 1.29562 | 0.5344 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 8 | 297.487 | 297.487 1.34571 | 0.5579 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [6, 6000] 损失:0.422 [跨集群重复 2 次]
train_cifar_bf064_00003 的结果:
 准确度:0.0958
 日期:2023-11-15_00-47-44
 完成:false
主机名:47e6634b378a
 iterations_since_restore:9
 损失:2.312095869445801
 node_ip:172.17.0.2
 pid:3728
 should_checkpoint:true
 time_since_restore:339.6230595111847
 time_this_iter _s: 34.33031988143921
 time_total_s: 339.6230595111847
 时间戳: 1700009264 
 Training_iteration: 9
 Trial_id: bf064_00003

== 状态 ==
当前时间: 2023-11-15 00:47:49 (运行 00:06:06.43)
使用 AsyncHyperBand: num_stopped=7\ nBracket:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 9 | 339.623 | 339.623 2.3121 | 2.3121 0.0958 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 5 | 298.83 | 298.83 1.29562 | 0.5344 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 8 | 297.487 | 297.487 1.34571 | 0.5579 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [6, 8000] 损失:0.315 [跨集群重复 2 次]
== 状态 ==
当前时间:2023-11-15 00:47:54(运行时间 00:06:11.44) 
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 9 | 339.623 | 339.623 2.3121 | 2.3121 0.0958 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 5 | 298.83 | 298.83 1.29562 | 0.5344 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 8 | 297.487 | 297.487 1.34571 | 0.5579 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:47:59(运行时间为 00:06:16.45)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 9 | 339.623 | 339.623 2.3121 | 2.3121 0.0958 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 5 | 298.83 | 298.83 1.29562 | 0.5344 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 8 | 297.487 | 297.487 1.34571 | 0.5579 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [6, 10000] 损失:0.251 [跨集群重复 3 次]
== 状态 ==
当前时间:2023-11-15 00:48:04(运行时间为 00:06:21.46) 
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 9 | 339.623 | 339.623 2.3121 | 2.3121 0.0958 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 5 | 298.83 | 298.83 1.29562 | 0.5344 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 8 | 297.487 | 297.487 1.34571 | 0.5579 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00007 的结果:
 准确度:0.5614
 日期:2023-11-15_00-48-05
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:9
 损失:1.3895927495241165
 node_ip:1 72.17.0.2\ n pid:5692 \ n should_checkpoint:true \ n time_since_restore:330.4981036186218 \ n time_this_iter_s:33.010908365249634 \ n time_total_s:330.4981036186218 \ n时间戳:1700009285 \ n Training_it次数: 9
 Trial_id: bf064_00007

(func pid=3728) [ 10, 4000] 损失: 1.155
train_cifar_bf064_00005 的结果:
 准确度: 0.5396
 日期: 2023-11-15_00-48-08
 完成: false
 主机名: 47e6634b378a
 iterations_since_restore: 6
 损失: 1.27774 6639201045 
 node_ip: 172.17.0.2
 pid: 4708
 should_checkpoint: true
 time_since_restore: 350.8566966056824
 time_this_iter_s: 52.026867151260376
 time_total_s: 350.8566966056824
时间戳:1700009288
 训练迭代:6
 试验 ID:bf064_00005
\ n== 状态 ==
当前时间:2023-11-15 00:48:13(运行时间为 00:06:31.28)
使用 AsyncHyperBand:num_stopped=7
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 6.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(3 次运行,7 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00003 |跑步 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 9 | 339.623 | 339.623 2.3121 | 2.3121 0.0958 |
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 6 | 350.857 | 350.857 1.27775 | 0.5396 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 9 | 330.498 | 1.38959 | 1.38959 0.5614 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=5692) [10, 2000] 损失:0.909
train_cifar_bf064_00003 的结果:
 准确度:0.1031
 日期:2023-11-15_00-48-18
 完成:true
 主机名:47e6634b378a
 iterations_since_restore : 10
 丢失: 2.3079761926651
 node_ip: 172.17.0.2
 pid: 3728
 should_checkpoint: true
 time_since_restore: 373.88145756721497
 time_this_iter_s: 34.25839805603027
 time_ Total_s: 373.88145756721497
 时间戳: 1700009298
 训练迭代: 10\ n Trial_id: bf064_00003

试用 train_cifar_bf064_00003 已完成。
== 状态 ==
当前时间: 2023-11-15 00:48:23 (运行时间 00:06:40.70)
使用 AsyncHyperBand: num_stopped=8
Bracket :迭代 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 4.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(2 次运行,8 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 6 | 350.857 | 350.857 1.27775 | 0.5396 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 9 | 330.498 | 1.38959 | 1.38959 0.5614 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [7, 4000] 损失:0.619 [跨集群重复 2 次]
== 状态 ==
当前时间:2023-11-15 00:48:28(运行时​​间为 00:06:45.71) 
使用 AsyncHyperBand:num_stopped=8
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 4.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(2 次运行,8 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 6 | 350.857 | 350.857 1.27775 | 0.5396 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 9 | 330.498 | 1.38959 | 1.38959 0.5614 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:48:33(运行时间为 00:06:50.72)
使用 AsyncHyperBand:num_stopped=8
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 4.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(2 次运行,8 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+--------------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 6 | 350.857 | 350.857 1.27775 | 0.5396 |
|火车_cifar_bf064_00007 |跑步 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 9 | 330.498 | 1.38959 | 1.38959 0.5614 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [7, 6000] 损失:0.403 [跨集群重复 2 次]
train_cifar_bf064_00007 的结果:
 准确度:0.5389
 日期:2023-11-15_00-48-37
 完成:true
主机名:47e6634b378a
 iterations_since_restore:10
 损失:1.4658892618656159
 node_ip:172.17.0.2
 pid:5692
 should_checkpoint:true
 time_since_restore:362.4083983898163
 time_this_ iter_s: 31.910294771194458
 time_total_s: 362.4083983898163
 时间戳: 1700009317 
 Training_iteration: 10
 Trial_id: bf064_00007

Trial train_cifar_bf064_00007 已完成。
== 状态 ==
当前时间:2023-11-15 00:48:42(运行时间为 00:06:59.63)
使用 AsyncHyperBand : num_stopped=9
括号: Iter 8.000: -1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 6 | 350.857 | 350.857 1.27775 | 0.5396 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [7, 8000] 损失: 0.307
== 状态 ==
当前时间: 2023-11-15 00:48:47 (运行 00:07:04.64)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 6 | 350.857 | 350.857 1.27775 | 0.5396 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [7, 10000] 损失: 0.243
== 状态 ==
当前时间: 2023-11-15 00:48:52 (运行 00:07:09.65)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 6 | 350.857 | 350.857 1.27775 | 0.5396 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:48:57(运行时间为 00:07:14.66)
使用 AsyncHyperBand:num_stopped=9
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 6 | 350.857 | 350.857 1.27775 | 0.5396 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00005 的结果:
 准确度:0.5466
 日期:2023-11-15_00-48-58
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:7
 损失:1.2611191002845765
 node_ip:1 72.17.0.2\ n pid:4708 \ n should_checkpoint:true \ n time_since_restore:400.0100951194763 \ n time_this_iter_s:49.153398513793945 \ n time_total_s:400.0100951194763 \ n时间戳:1700009338 \ n Training_it版本: 7
 Trial_id: bf064_00005

== 状态 ==
当前时间:2023-11-15 00:49:03(运行时间为 00:07:20.43)
使用 AsyncHyperBand:num_stopped=9
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 7 | 400.01 | 1.26112 | 0.5466 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [8, 2000] 损失: 1.206
== 状态 ==
当前时间: 2023-11-15 00:49:08 (运行 00:07:25.44)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 7 | 400.01 | 1.26112 | 0.5466 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:49:13(运行时间为 00:07:30.45)
使用 AsyncHyperBand:num_stopped=9
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 7 | 400.01 | 1.26112 | 0.5466 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [8, 4000] 损失: 0.590
== 状态 ==
当前时间: 2023-11-15 00:49:18 (运行 00:07:35.46)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 7 | 400.01 | 1.26112 | 0.5466 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [8, 6000] 损失: 0.403
== 状态 ==
当前时间: 2023-11-15 00:49:23 (运行 00:07:40.47)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 7 | 400.01 | 1.26112 | 0.5466 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:49:28(运行时​​间为 00:07:45.47)
使用 AsyncHyperBand:num_stopped=9
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 7 | 400.01 | 1.26112 | 0.5466 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [8, 8000] 损失: 0.297
== 状态 ==
当前时间: 2023-11-15 00:49:33 (运行 00:07:50.48)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 7 | 400.01 | 1.26112 | 0.5466 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:49:38(运行时​​间为 00:07:55.49)
使用 AsyncHyperBand:num_stopped=9
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 7 | 400.01 | 1.26112 | 0.5466 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [8, 10000] 损失: 0.238
== 状态 ==
当前时间: 2023-11-15 00:49:43 (运行 00:08:00.50)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.8288495283782482 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 7 | 400.01 | 1.26112 | 0.5466 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00005 的结果:
 准确度:0.5557
 日期:2023-11-15_00-49-45
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:8
 损失:1.2735305621564388
 node_ip:1 72.17.0.2\ n pid:4708 \ n should_checkpoint:true \ n time_since_restore:447.28948068618774 \ n time_this_iter_s:47.279385566711426 \ n time_total_s:447.28948068618774 \ n时间戳:1700009385 \ n训练_iteration: 8
 Trial_id: bf064_00005

== 状态 ==
当前时间:2023-11-15 00:49:50(运行时间为 00:08:07.71)
使用 AsyncHyperBand:num_stopped=9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 8 | 447.289 | 447.289 1.27353 | 0.5557 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [9, 2000] 损失: 1.175
== 状态 ==
当前时间: 2023-11-15 00:49:55 (运行 00:08:12.72)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 8 | 447.289 | 447.289 1.27353 | 0.5557 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:50:00(运行时间为 00:08:17.73)
使用 AsyncHyperBand:num_stopped=9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 8 | 447.289 | 447.289 1.27353 | 0.5557 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [9, 4000] 损失: 0.585
== 状态 ==
当前时间: 2023-11-15 00:50:05 (运行 00:08:22.74)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 8 | 447.289 | 447.289 1.27353 | 0.5557 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [9, 6000] 损失: 0.386
== 状态 ==
当前时间: 2023-11-15 00:50:10 (运行 00:08:27.74)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 8 | 447.289 | 447.289 1.27353 | 0.5557 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:50:15(运行时间为 00:08:32.75)
使用 AsyncHyperBand:num_stopped=9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 8 | 447.289 | 447.289 1.27353 | 0.5557 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [9, 8000] 损失: 0.285
== 状态 ==
当前时间: 2023-11-15 00:50:20 (运行 00:08:37.76)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 8 | 447.289 | 447.289 1.27353 | 0.5557 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:50:25(运行时间为 00:08:42.77)
使用 AsyncHyperBand:num_stopped=9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 8 | 447.289 | 447.289 1.27353 | 0.5557 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [9, 10000] 损失: 0.234
== 状态 ==
当前时间: 2023-11-15 00:50:30 (运行 00:08:47.78)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 8 | 447.289 | 447.289 1.27353 | 0.5557 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00005 的结果:
 准确度:0.5823
 日期:2023-11-15_00-50-32
 完成:false
 主机名:47e6634b378a
 iterations_since_restore:9
 损失:1.176440757805109
 node_ip:17 2.17.0.2\ n pid:4708 \ n should_checkpoint:true \ n time_since_restore:494.55520391464233 \ n time_this_iter_s:47.26572322845459 \ n time_total_s:494.55520391464233 \ n时间戳:1700009432 \ n Training_迭代: 9
 Trial_id: bf064_00005

== 状态 ==
当前时间:2023-11-15 00:50:37(运行时间为 00:08:54.98)
使用 AsyncHyperBand:num_stopped=9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 9 | 494.555 | 494.555 1.17644 | 0.5823 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [10, 2000] 损失: 1.121
== 状态 ==
当前时间: 2023-11-15 00:50:42 (运行 00:08:59.99)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 9 | 494.555 | 494.555 1.17644 | 0.5823 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:50:47(运行时间为 00:09:05.00)
使用 AsyncHyperBand:num_stopped=9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 9 | 494.555 | 494.555 1.17644 | 0.5823 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [10, 4000] 损失: 0.564
== 状态 ==
当前时间: 2023-11-15 00:50:52 (运行 00:09:10.00)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 9 | 494.555 | 494.555 1.17644 | 0.5823 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [10, 6000] 损失: 0.376
== 状态 ==
当前时间: 2023-11-15 00:50:57 (运行 00:09:15.02)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 9 | 494.555 | 494.555 1.17644 | 0.5823 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:51:02(运行时间为 00:09:20.02)
使用 AsyncHyperBand:num_stopped=9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 9 | 494.555 | 494.555 1.17644 | 0.5823 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [10, 8000] 损失: 0.283
== 状态 ==
当前时间: 2023-11-15 00:51:07 (运行 00:09:25.03)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 9 | 494.555 | 494.555 1.17644 | 0.5823 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


== 状态 ==
当前时间:2023-11-15 00:51:12(运行时间为 00:09:30.04)
使用 AsyncHyperBand:num_stopped=9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 9 | 494.555 | 494.555 1.17644 | 0.5823 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


(func pid=4708) [10, 10000] 损失: 0.228
== 状态 ==
当前时间: 2023-11-15 00:51:17 (运行 00:09:35.05)
使用 AsyncHyperBand: num_stopped= 9
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 2.0/16 个 CPU, 0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(1 次运行,9 次终止)
+------------------------+------------+-----------------+--------------+------+--------+-------------+--------+-----------------+----------+------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00005 |跑步 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 9 | 494.555 | 494.555 1.17644 | 0.5823 |
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


train_cifar_bf064_00005 的结果:
 准确度:0.5684
 日期:2023-11-15_00-51-19
 完成:true
 主机名:47e6634b378a
 iterations_since_restore:10
 损失:1.2000473774269222
 node_ip: 172.17.0.2\ n pid:4708 \ n should_checkpoint:true \ n time_since_restore:541.2027287483215 \ n time_this_iter_s:46.6475248336792 \ n time_total_s:541.2027287483215 \ n时间戳:1700009479 \ n Training_iteration: 10
 Trial_id: bf064_00005

试用 train_cifar_bf064_00005 已完成。
==状态==
当前时间:2023-11-15 00:51:19(运行时间为00:09:36.63)
使用AsyncHyperBand:num_stopped=10
括号:Iter 8.000:-1.345710159623623 |迭代 4.000:-1.4287443323016167 |迭代 2.000:-1.4916184381961823 | Iter 1.000: -2.290968679594994
逻辑资源使用: 0/16 个 CPU、0/1 个 GPU (0.0/1.0 Accelerator_type:M60)
结果日志目录: /var/lib/jenkins/ray_results/train_cifar_2023-11-15_00-41-42\ n试验次数:10/10(10 次终止)
+------------------------+--------------------------------+--------------+------+------+- ----------+--------+--------------------+--------- +------------+
|试用名称 |状态 |洛克|批量大小 | l1 | l2 | LR |迭代器 |总时间(秒)|损失|准确度 |
|------------------------+------------+-----------------+--------------+------+-----+-------------+--------+--------------------+---------+--------------|
|火车_cifar_bf064_00000 |终止 | 172.17.0.2:2681 | 2 | 16 | 16 1 | 0.00213327 | 1 | 123.009 | 123.009 2.30391 | 2.30391 0.102 |
|火车_cifar_bf064_00001 |终止 | 172.17.0.2:2752 | 4 | 1 | 2 | 0.013416 | 1 | 67.5308 | 2.30819 | 0.1008 |
|火车_cifar_bf064_00002 |终止 | 172.17.0.2:3240 | 2 | 256 | 256 64 | 64 0.0113784 | 1 | 165.896 | 165.896 2.31673 | 0.1028 |
|火车_cifar_bf064_00003 |终止 | 172.17.0.2:3728 | 8 | 64 | 64 256 | 256 0.0274071 | 10 | 10 373.881 | 2.30798 | 0.1031 |
|火车_cifar_bf064_00004 |终止 | 172.17.0.2:4217 | 4 | 16 | 16 2 | 0.056666 | 0.056666 1 | 73.2094 | 2.33591 | 2.33591 0.0986 |
|火车_cifar_bf064_00005 |终止 | 172.17.0.2:4708 | 4 | 8 | 64 | 64 0.000353097 | 0.000353097 10 | 10 541.203 | 541.203 1.20005 | 0.5684 |
|火车_cifar_bf064_00006 |终止 | 172.17.0.2:5199 | 8 | 16 | 16 4 | 0.000147684 | 1 | 44.7568 | 2.27802 | 2.27802 0.1689 |
|火车_cifar_bf064_00007 |终止 | 172.17.0.2:5692 | 8 | 256 | 256 256 | 256 0.00477469 | 10 | 10 362.408 | 1.46589 | 1.46589 0.5389 |
|火车_cifar_bf064_00008 |终止 | 172.17.0.2:2752 | 8 | 128 | 128 256 | 256 0.0306227 | 0.0306227 1 | 47.3137 | 47.3137 2.18068 | 2.18068 0.2189 |
|火车_cifar_bf064_00009 |终止 | 172.17.0.2:5199 | 2 | 2 | 16 | 16 0.0286986 | 1 | 116.122 | 116.122 2.32186 | 2.32186 0.0994 |
+------------------------+------------+-------------------+--------------+------+-----+----------------+--------+--------------------+--------------------+-------------+


2023-11-15 00:51:19,257 INFO tune.py:945 -- Total run time: 576.72 seconds (576.62 seconds for the tuning loop).
Best trial config: {'l1': 8, 'l2': 64, 'lr': 0.0003530972286268149, 'batch_size': 4}
Best trial final validation loss: 1.2000473774269222
Best trial final validation accuracy: 0.5684
Files already downloaded and verified
Files already downloaded and verified
Best trial test set accuracy: 0.5715

如果运行代码,示例输出可能如下所示:

Number of trials: 10/10 (10 TERMINATED)
+-----+--------------+------+------+-------------+--------+---------+------------+
| ... |   batch_size |   l1 |   l2 |          lr |   iter |    loss |   accuracy |
|-----+--------------+------+------+-------------+--------+---------+------------|
| ... |            2 |    1 |  256 | 0.000668163 |      1 | 2.31479 |     0.0977 |
| ... |            4 |   64 |    8 | 0.0331514   |      1 | 2.31605 |     0.0983 |
| ... |            4 |    2 |    1 | 0.000150295 |      1 | 2.30755 |     0.1023 |
| ... |           16 |   32 |   32 | 0.0128248   |     10 | 1.66912 |     0.4391 |
| ... |            4 |    8 |  128 | 0.00464561  |      2 | 1.7316  |     0.3463 |
| ... |            8 |  256 |    8 | 0.00031556  |      1 | 2.19409 |     0.1736 |
| ... |            4 |   16 |  256 | 0.00574329  |      2 | 1.85679 |     0.3368 |
| ... |            8 |    2 |    2 | 0.00325652  |      1 | 2.30272 |     0.0984 |
| ... |            2 |    2 |    2 | 0.000342987 |      2 | 1.76044 |     0.292  |
| ... |            4 |   64 |   32 | 0.003734    |      8 | 1.53101 |     0.4761 |
+-----+--------------+------+------+-------------+--------+---------+------------+

Best trial config: {'l1': 64, 'l2': 32, 'lr': 0.0037339984519545164, 'batch_size': 4}
Best trial final validation loss: 1.5310075663924216
Best trial final validation accuracy: 0.4761
Best trial test set accuracy: 0.4737

为了避免浪费资源,大多数试验已提前停止。 表现最好的试验的验证准确度达到了约 47%, 这可以在测试集上得到证实。

这样’ 就可以了!您现在可以调整 PyTorch 模型的参数。

脚本的总运行时间: ( 10 分钟 0.422 秒)


我们一直在努力

apachecn/AiLearning

【布客】中文翻译组