String

约 12218 字大约 41 分钟

2025-06-21

1.string 的简介

string 其实不算 STL 的一部分（STL 的诞生的比 string 晚），但是和 STL 一起学习会更加容易融会贯通。

补充：由于 string 较早诞生，因此在查阅文档的时候，我们很容易发现一些冗余的接口，一些比较不通用的接口我放在文章的末尾，您若需要了解，直接前往 C++官网或 cplusplus 查询即可...

实际上 string 是类模板 basic_string 实例化出来的一个具体类，使用字符的顺序容器实现（String are objects represent sequences of characters. 字符串是表示字符序列的对象，也就是关于字符的顺序表），string 整个系列支持 char 的动态增长。

typedef basic_string<char> string;

补充：basic_string 的实例化除了 string（单字节），还有 wstring(宽字符、双字节)、u16string(16位字符，双字节)、u32string(32位字符，四字节)，它们各自维护的类型是 char、wchar_t、char16_t、char32_t，这些是属于编码的问题，不过最常用的还是 string 类。

2.string 的使用

这里我给 string 类大致的成员变量、成员函数、非成员重载，我将在后面详细介绍使用方法和一些细节：

//string 类的大概成员
class string
{
public:
    //1.构造函数
    string();
    //2.析构函数
    ~string();
    //3.赋值重载
    operator= ();
    //4.修改器
    push_back();
    pop_back();
    insert();
    erase();
    operator+=();
    swap();
    //5.元素访问
    operator[] (); at();
    front(); back();
    //6.迭代器
    begin(); end();
    rbegin(); rend();
    //7.容积
    size(); length(); capacity();
    //8.运算符
    operator> (); operator< (); operator== (); operator!= ();
    
private:
    char* _str; //指向由 new 开辟出来的动态空间，该空间存储字符串
    size_t _size; //string 对象对应字符串的大小/长度（不包含 '\0'）
    size_t _capacity; //string 对象的容量，在存储字符串的时候，末尾会多存储一个空字符，方便转化为 C 风格的字符串，在不同的环境扩容机制有些许不同，有的直接 2 倍，有的 1.5 倍
    
    const size_t npos; //静态成员
};
static const size_t npos = -1;

//9.非成员函数重载
operator<< (); 
operator>> ();

补充：上述类只是伪代码，更加详细的说明和描述您可以去 cplusplus-string 或 C++官网上查询...

2.1.成员函数

2.1.1.构造函数

作用	声明
default	`string();`
from c-string	`string(const char* s);`
copy	`string(const string& str);`
substring	`string(const string& str, size_t pos, size_t len = npos);`
fill	`string(size_t n, char c);`
range	`template <class InputIterator>` `string(InputIterator first, InputIterator last);`

//构造函数的使用
#include <iostream>
#include <string>
using std::cout;
using std::string;

void Test1()
{
    //string();
    /* 默认的构造函数，调用后会构造一个长度为 0 的空字符串（但是有一定的初始容量） */
    cout << "-- Test1():default --" << '\n';

    string str;
    cout << "大小 = " << str.size() << '\n';
    cout << "容量 = " << str.capacity() << '\n';
    cout << str << "\n\n";
}

void Test2()
{
    //string(const char* s); 
    /* 调用构造函数后，会使用 C 风格的字符串来构造 string 对象 str */
    cout << "-- Test2():from c-string --" << '\n';

    string str("limou3434"); 
    //std::string str = "limou3434"; //或者使用这种方法构造 string 的 str 对象
    cout << "大小 = " << str.size() << '\n';
    cout << "容量 = " << str.capacity() << '\n';
    cout << str << "\n\n";
}

void Test3()
{
    //string(const string& str);
    /* 对现有的 string 对象进行拷贝 */
    cout << "-- Test3():copy --" << '\n';

    string src = "Hello";
    string copy = src; //这里发生了拷贝构造，也就是复制 str 构造一个新的 string 对象。
    cout << "大小 = " << copy.size() << '\n';
    cout << "容量 = " << copy.capacity() << '\n';
    cout << src << "\n\n";
}

void Test4()
{
    //string(const string& str, size_t pos, size_t len = npos);
    /* 对现有的 string 对象 str 的部分的字符（字符串中从 pos 开始长度为 len 的子串部分）
    来拷贝构造一个新的 string 对象，如果没有指定 len 的值，或者 str 较短，则一直拷贝到 str 的结尾 */
    cout << "-- Test4():substring --" << '\n';

    string str = "limou3434";

    string sub1(str, 2, 4); //拷贝源字符串从 2 开始的长度为 4 的子串
    cout << sub1 << '\n';

    string sub2(str, 2); //拷贝源字符串从索引 2 开始往后的所有字符
    cout << sub2 << '\n';

    string sub3(str, 2, 1000); //源字符串太小，实际只取得了 2 开始完后的所有字符
    cout << sub3 << "\n\n";
}

void Test5()
{
    //string(size_t n, char c);
    /* 用 n 个相同的 c 字符连续填充字符串。 */
    cout << "-- Test5():fill --" << '\n';

    string str(5, 'c');
    cout << str << "\n\n";
}

void Test6()
{
    //template <class InputIterator>
    //string(InputIterator first, InputIterator last);
    /* 和源 string 对象相同的顺序，来复制源 string 对象中
    迭代器范围为 [first,last) 的字符序列，构造出新的 string 对象
    由于我们还没有学过迭代器，因此我们直接上代码观看迭代器的使用即可*/

    //直接初始化一个字符串
    cout << "-- Test6():range --" << '\n';
    string str1("Hello World");
    cout << "str1(\"Hello World\") = "
        << str1 << std::endl;

    //使用迭代器范围构造一个新的字符串
    string str2(str1.begin() + 1, str1.begin() + 7); //这里 begin() 就是迭代器的使用，您可以简单理解为有一个指针，指向了这个 string 对象的字符串开头位置，迭代器的出现就是为了能是使得一些对象可以类似指针一样使用

    //输出子串
    cout << "str2(str1.begin() + 1, str1.begin() + 7) = "
        << str2 << std::endl;
}

int main()
{
    Test1();
    Test2();
    Test3();
    Test4();
    Test5();
    Test6();
    return 0;
}

补充：这里摘去了构造函数 string (const char* s, size_t n);...

2.1.2.析构函数

作用	声明
destructor	`~string();`

~string() 会释放 string 对象所使用的堆空间（这些堆空间是由分配器来分配的，了解一下就可以），从这也可以侧面看出 string 类实际是一个顺序表，析构函数会在 string 对象声明周期结束时自动调用。

2.1.3.赋值重载

作用	声明
string (1)	`string& operator= (const string& str);`
c-string (2)	`string& operator= (const char* s);`
character (3)	`string& operator= (char c);`

上述的作用基本都是将一个新值赋给已定义后的 string 对象，用新值来替换该 string 对象的内容。

//赋值重载的使用
#include <iostream>
#include <string>
using std::cout;
using std::string;

int main()
{
    string str1, str2, str3, str4; //创建多个 string 对象

    //string& operator= (const char* s);
    str1 = "hello"; //赋值 C 风格的字符串
    cout << str1 << '\n';

    //string& operator= (char c);
    str2 = 'x'; //赋值单个字符
    cout << str2 << '\n';

    //string& operator= (const string& str);
    str3 = str1 + str2; //赋值 string 对象
    cout << str3 << '\n';

    return 0;
}

2.1.4.修改器

作用	声明
Append character to string	`void push_back(char c);`
Delete last character	`void pop_back();`
Insert into string	`string& insert(size_t pos, const string& str);`
Erase characters from string	`string& erase(size_t pos, size_t len = npos);`
Append to string	`string& operator+=(const string& str);`
Swap string values	`void swap(string& str);`

2.1.4.1.operator+=()

用得比较多。

//重载声明
string& operator+= (const string& str);
string& operator+= (const char* s);
string& operator+= (char c);

2.1.4.2.append()

尾插，即便有很多重载，但是用的比较少，这里只给出声明，您可以自己玩玩看。

//重载声明
string& append (const string& str);

string& append (const string& str, size_t subpos, size_t sublen);

string& append (const char* s);

string& append (const char* s, size_t n);

string& append (size_t n, char c);

template <class InputIterator>  
string& append (InputIterator first, InputIterator last);

string& append (initializer_list<char> il);

2.1.4.3.push_back ()

尾插单字符，爷用的比较少，您也可以自己玩玩。

//函数声明
void push_back (char c);

添加字符 c 到 string 对象的末尾，并且 size() 长度 +1。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    string str("abcdef");
    cout << "str = " << str << endl;
    cout << "str.size() = " << str.size() << endl;
    str.push_back('x');
    cout << "str = " << str << endl;
    cout << "str.size() = " << str.size() << endl;
    return 0;
}

可以拿尾插类的函数来测试环境的扩容机制：

#include <iostream>
#include <string>
using namespace std;
#define NUMBER 100
void test()
{
    string str;
    size_t old = str.capacity();
    cout << "Init:" << old << endl;
    size_t newValue = 0;
    for (size_t i = 0; i < NUMBER; i++)
    {
        str.push_back('x');//推送
        newValue = str.capacity();//记录新值
        if (newValue != old)//新值和旧值对比
        {
            cout << "dilatation = " << newValue << " ";
            old = newValue;
            cout << endl;
        }
    }
    cout << endl;
}
int main()
{
    test();
    return 0;
}

2.1.4.4.instert()

//重载声明
string& insert (size_t pos, const string& str);//插入 string 对象内的字符串

string& insert (size_t pos, const string& str, size_t subpos, size_t sublen);//插入子串，注意这里后两个参数不是输入子串的首尾索引，而是输入子串的起始索引和子串长度

string& insert (size_t pos, const char* s);//插入 C 风格的字符串

string& insert (size_t pos, const char* s, size_t n);////插入 C 风格的字符串，并且限定长度

string& insert (size_t pos, size_t n, char c);//重复插入 n 个 c 字符

iterator insert (const_iterator p, size_t n, char c);//使用迭代器来，插入重复的字符，并返回指向第一个插入元素的迭代器

iterator insert (const_iterator p, char c);//使用迭代定位，插入一个字符，并返回指向第一个插入元素的迭代器

template <class InputIterator>
iterator insert (iterator p, InputIterator first, InputIterator last);//将迭代器[first, last)的字符插入到目标 string 对象的迭代器定位处

string& insert (const_iterator p, initializer_list<char> il);//在迭代器定位处插入字符序列

可以看到这个函数有很多重载版本，让我们来用一个测试例子研究一下，不过这个函数能不用就最好别用，效率不高。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    cout << "1.string& insert (size_t pos, const string& str);" << endl;
    string str1("abcdef");
    cout << str1 << endl;
    string sstr1 = "xxxxx";
    str1.insert(0, sstr1);
    cout << str1 << endl << endl;
    cout << "2.string& insert (size_t pos, const string& str, size_t subpos, size_t sublen);" << endl;
    string str2("abcdef");
    cout << str2 << endl;
    string sstr2 = "ABCDEEEEFGHIJK";
    str2.insert(str2.size(), sstr2, 4, 4);
    cout << str2 << endl << endl;

    cout << "3.string& insert (size_t pos, const char* s);" << endl;
    string str3 = "abcdef";//这是一个 C 风格的字符串
    cout << str3 << endl;
    const char* sstr3 = "__________";
    str3.insert(str3.size(), sstr3);
    cout << str3 << endl << endl;;

    cout << "4.string& insert (size_t pos, const char* s, size_t n);" << endl;
    string str4("abcdef");
    cout << str4 << endl;
    const char* sstr4 = "bbbbbxxxxxxxxxxxxxx";//只想截取长度为前 5 个字符的字符
    str4.insert(1, sstr4, 5);
    cout << str4 << endl << endl;;

    cout << "5.string& insert (size_t pos, size_t n, char c);" << endl;
    string str5("ABCDEFG");
    cout << str5 << endl;
    str5.insert(1, 20, '+');
    cout << str5 << endl << endl;

    cout << "6.iterator insert(const_iterator p, size_t n, char c);" << endl;
    string str6  = "QWERT";
    str6.insert(str6.begin() + 3, 10, 'c');
    cout << str6 << endl << endl;

    cout << "7.iterator insert (const_iterator p, char c);" << endl;
    string str7 = "AAAAAAAAA";
    str7.insert(str7.end(), 'c');
    cout << str7 << endl << endl;

    cout << "8.template <class InputIterator> " << endl;
    cout << "  iterator insert(iterator p, InputIterator first, InputIterator last);" << endl;
    string str8 = "XXXX";
    string sstr8 = "|bcde|";
    str8.insert(str8.begin(), sstr8.begin() + 1, sstr8.end() - 1);
    cout << str8 << endl << endl;

    cout << "9. string& insert (const_iterator p, initializer_list<char> il);" << endl;
    string str9 = "XXXX";
    str9.insert(str9.begin() + 2, { 'a', 'b', 'c' });
    cout << str9 << endl << endl;

    return 0;
}

2.1.4.5.erase()

string& erase (size_t pos = 0, size_t len = npos);//全部删除（默认）

iterator erase (const_iterator p);//使用迭代器删除指定位置

iterator erase (const_iterator first, const_iterator last);//

这个函数主要是用来擦除字符串的一部分，并减小 size() 的值，直接使用则默认为全部删除，这个函数能不用就最好别用，也是效率不太高。

string str = "abcdef";
cout << str << endl;

str.erase(str.begin() + 3);
cout << str << endl;

str.erase(str.begin(), str.end() - 2);
cout << str << endl;

str.erase();
cout << str << endl;

2.1.4.6.replace()

//string (1)    
string& replace (size_t pos,        size_t len,        const string& str);
string& replace (const_iterator i1, const_iterator i2, const string& str);

//substring (2)    
string& replace (size_t pos, size_t len,        const string& str, size_t subpos, size_t sublen);

//c-string (3)    
string& replace (size_t pos, size_t len, const char* s);
string& replace (const_iterator i1, const_iterator i2, const char* s);

//buffer (4)    
string& replace (size_t pos, size_t len,        const char* s, size_t n);
string& replace (const_iterator i1, const_iterator i2, const char* s, size_t n);

//fill (5)    
string& replace (size_t pos, size_t len,        size_t n, char c);
string& replace (const_iterator i1, const_iterator i2, size_t n, char c);

//range (6)    
template <class InputIterator>
  string& replace (const_iterator i1, const_iterator i2, InputIterator first, InputIterator last);

//initializer list (7)    
string& replace (const_iterator i1, const_iterator i2, initializer_list<char> il);

该函数主要用于替换，参数也很多，您可以稍微玩一下：

string str = "abc def";
cout << str << endl;

str.replace(3, 2, "&%#");//将 str 索引为 3 起始的长度为 2 的子串替换成 "&%#"
cout << str << endl;

也不太建议使用，效率比较低下，需要的时候可以使用一下。当然我们也可以自己设计一个算法解决：

int main()
{
    //设计一个算法让下面 string 对象的空格改为 20%
    string str = "abc def";
    string strx;
    cout << str << endl;
    //空间换时间
    for (auto ch : str)
    {
        if (ch != ' ')
        {
            strx += ch;
        }
        else
        {
            strx += "20%";
        }
    }
    //str = strx;
    //或者使用全局的 swap()，但是使用这个会效率很低，内部有一次拷贝构造两次赋值，很糟糕
    str.swap(strx);//string 的 swap()效率就会高很多，这个成员函数只是交换了指针指向，效率很高
    cout << str << endl;
    //这样的效率都会比较高

    //不过需要注意的是，就算我们直接使用全局的 swap()也不会真的调用到最麻烦的哪一个，因为库已经针对全局的 swap()进行了重载，内部调用的也是成员的 swap()
    return 0;
}

2.1.4.7.swap()

void swap (string& str);

成员交换函数，效率比全局的要高，就是前面代码使用的那一个函数。后面介绍的全局 swap() 调用的就是这个成员函数。

补充：这里摘去了 assign()...

2.1.5.元素访问

2.1.5.1.operator []

//重载形式
char& operator[] (size_t pos);
const char& operator[] (size_t pos) const;

pos 的起始位置为 0，这意味这和数组的使用形式类似。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    string str("abcdef");
    for (int i = 0; str[i]; i++)
    {
        cout << str[i] << " ";
    }
    return 0;
}

如果 pos 等于 string 对象的 size 值，这个符号重载函数就会返回一个指向空字符的引用，该字符位于 string 对象最后一个字符之后（这个字符不可被修改，只所以可以被允许访问的其中一个原因就是方便写循环终止条件，就像上面的循环例子）。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    string str("abcdef");
    cout << "空字符->[" << str[str.size()] << "]" << endl;//可以看到空字符没有办法打印出来
    return 0;
}

2.1.5.2.at()

//重载声明
char& at (size_t pos);
const char& at (size_t pos) const;

该函数可以获取字符串中的字符，返回一个 pos 指向字符的引用，该函数自动检查 pos 是否是 string 对象中合法的字符索引，非法则抛出异常。

那么 at() 和 [] 有什么区别呢？答案在于自动检查出现非法时，例如下面的代码：

#include <iostream>
#include <string>
using namespace std;
int main()
{
    try
    {
        string str("abcdef");
        cout << "str = " << str << endl;
        cout << "str.size() = " << str.size() << endl;
        //cout << " str[7] = " << str[7] << endl;//第一情况
        //cout << " at(7) = " << str.at(7) << endl;//第二情况
    }
    catch (const std::out_of_range& e)//出现异常时执行下面代码
    {
        cout << "error" << endl;
    }
    return 0;
}

使用 [] 的报错和使用 at 的报错不一样。总的来说，抛异常会更加安全（并且抛异常是面向对象语言的一个显著特征）具体的抛异常细节我们以后再来学习……

2.1.5.3.back()

//重载声明
char& back();
const char& back() const;

该函数返回最后一个有效字符的引用（不是指空字符），此函数不能再空 string 对象上调用。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    string str1("abcdef");
    cout << "str1 = " << str1 << endl;
    cout << "str1.size() = " << str1.size() << endl;
    cout << "str1.back() = " << str1.back() << endl;
    string str2("");
    cout << "str2 = " << str2 << endl;
    cout << "str2.size() = " << str2.size() << endl;
    //cout << " str2.back() = " << str2.back() << endl;//会出现报错
    return 0;
}

2.1.5.4.front()

char& front();
const char& front() const;

类似 end() 的使用，不过获取到字符是开头的字符。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    string str1("abcdef");
    cout << "str1 = " << str1 << endl;
    cout << "str1.size() = " << str1.size() << endl;
    cout << "str1.front() = " << str1.front() << endl;
    string str2("");
    cout << "str2 = " << str2 << endl;
    cout << "str2.size() = " << str2.size() << endl;
    //cout << " str2.front() = " << str2.front() << endl;//直接会出现报错
    return 0;
}

2.1.6.迭代器

这里应该是我们第一次使用迭代器，迭代器实际上是方便遍历容器的一种方案，让程序员不再关注底层细节就可以编译一个容器（实际上迭代器就是 C++ 封装的一种体现）。

在没有学习迭代器之前，如果我们需要遍历一个 string 对象需要怎么做呢？目前我们最常用的方法就是使用循环语句。

2.1.6.1.for 遍历

//使用 for 语句遍历字符串对象
#include <iostream>
#include <string>
int main()
{
    std::string str("abcd");
    for (int i = 0; i < str.size(); i++)
    {
        std::cout << str[i] << " "; //[] 运算符重载
    }
    return 0;
}

补充：这里使用了 string 类的成员函数 size()，这个函数专门计算 string 对象的大小，这在后面讲解 string 类的成员函数时，还会进行讲解。

2.1.6.2.迭代器遍历

迭代器的行为十分类似指针（但其底层实现不一定是指针）使得 string 对象可以像数组一样被使用：

//使用迭代器遍历字符串对象
#include <iostream>
#include <string>
int main()
{
    std::string str("abcd");
    std::string::iterator it = str.begin();//这里如果类型名字太长，可以使用 auto 来自动推导
    while (it != str.end())//这里不推荐使用'<'符号
    {
        std::cout << *it << " ";
        (*it)++;
        it++;
    }
    std::cout << std::endl << str << std::endl;
    return 0;
}

补充：在 while (it != str.end()) 这里不推荐使用 < 符号，我们之前说过，迭代器的行为类似指针，而实际上 string 迭代器的底层很有可能就是指针，因此在这里使用 < 是可能被允许的，但最好不要这么做。这是因为，其他类底层的迭代器不一定是靠纯指针实现的，例如：链表，因此使用 < 会有风险。

我们甚至可以使用反向迭代器来快速反向遍历。

//使用反向迭代器遍历字符串对象
#include <iostream>
#include <string>

int main()
{
    std::string str("abcd");
    std::string::reverse_iterator rit = str.rbegin();
    while (rit != str.rend())
    {
        std::cout << *rit << " ";
        (*rit)++;
        rit++;//注意依旧是++，而不是--
    }
    std::cout << std::endl << str << std::endl;
    return 0;
}

迭代器不仅仅是 string 类使用，基本所有库里的类都可以使用迭代器遍历，迭代器有几种便利之处：

通用，在一些不支持 [] 重载的结构也可使用迭代器
是无需考虑差 1 问题
屏蔽底层实现细节，可以直接使用

针对不同的对象和使用方法，迭代器可以分为下面几类迭代器。

类别	可读可修改	可读不可修改
正向迭代器	`iterator begin() noexcept;` `iterator end() noexcept;`	`const_iterator begin() const noexcept;` `const_iterator end() const noexcept;`
反向迭代器	`reverse_iterator rbegin() noexcept;` `reverse_iterator rend() noexcept;`	`const_reverse_iterator rbegin() const noexcept;` `const_reverse_iterator rend() const noexcept;`

类别	函数声明
普通对象的常量迭代器	`const_iterator cbegin() const noexcept;` `const_iterator cend() const noexcept;`
常量对象的常量迭代器	`const_reverse_iterator crbegin() const noexcept;` `const_reverse_iterator crend() const noexcept;`

其中 iterator 是一个类模板：

//iterator 类
template <class T>
class iterator 
{ 
    //1.正向迭代器
    //迭代器的成员函数和操作符重载定义
    iterator begin() noexcept;
    const_iterator begin() const noexcept;
    iterator end() noexcept;
    const_iterator end() const noexcept; 
    
    //2.反向迭代器
    reverse_iterator rbegin() noexcept;
    const_reverse_iterator rbegin() const noexcept;
    reverse_iterator rend() noexcept;
    const_reverse_iterator rend() const noexcept;
    //...
};

2.1.6.3.范围 for 遍历

//使用范围 for 遍历字符串对象
#include <iostream>
#include <string>

int main() 
{
    std::string str = "Hello";
    //使用范围 for 循环遍历字符串中的每个字符
    for (auto ch : str) 
    {
        std::cout << ch << " ";
    }
    return 0;
}

实际上范围 for 的底层代码就是迭代器，只不过范围 for 封装得更加厉害罢了（在一些编译器内部调试的时候可以切到范围 for 的汇编语句来检验两者关系）。因此，只要对应的容器类支持迭代器，就自动支持范围 for 的使用。

2.1.7.容积

类别	函数声明
获取长度和容量	`size_t size() const noexcept;` `size_t length() const noexcept;` `size_t capacity() const noexcept;`
改变长度和容量	`void resize (size_t n);` `void resize (size_t n, char c);` `void reserve (size_t n = 0);`
获取环境最大容量数值	`size_t max_size() const noexcept;`
清理对象内容	`void clear() noexcept;`
判空	`bool empty() const noexcept;`

2.1.7.1.size() 和 length() 和 capacity()

首先我们有一个问题，为什么会有两个计算长度的成员函数呢？这是因为 string 类得诞生比 STL 早，最开始设计的是 length()，后来出现了 STL 才开始使用了（并且建议使用）size()。两个都可以计算 string 对象内部字符串长度，在功能上是等效的。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    string str("abcdefghijklmn");
    cout << str.size() << endl;
    cout << str.length() << endl;
    return 0;
}

补充：如果查看其他的结构会发现都是使用 size()，而没有 length()

而另外一个函数 capacity() 则可以求得当前 string 对象的容量。

2.1.7.2.clear()

这个成员函数会对 string 对象的字符串作清理，但是不会释放空间，同时原 string 的容量不变，对象的 siez() 值置为空。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    string str("abcdefghijklmn");
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.capacity() << endl;
    str.clear();
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.length() << endl;

    return 0;
}

2.1.7.3.max_size()

这个成员函数用于告知用户该环境下的 string 对象最多可以申请到多少字节的空间。

但是这个值是写死的，只是在不同环境下会不同罢了，因此基本没怎么用到这个。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    string str1("abcdefghijklmn");
    cout << str1.max_size() << endl;
    string str2("abcdefghijklmn");
    cout << str2.max_size() << endl;
    return 0;
}

2.1.7.4.resize()

resize() 可以将 string 对象的大小（即：size() 的值），有以下集几种情况：

如果 n 小于 string 对象原有的大小，则删除 n 个字符以外的字符（这个操作比较危险）
如果 n 大于 string 对象原有的大小，则尾插任意字符（默认表现为空字符 \0）扩充到当前的内容，使得大小达到 n。因此如果有填充某个字符串的需求，也是可以用这个函数的
如果指定了字符 c，则多开辟的空间初始化为字符 c

#include <iostream>
#include <string>
using namespace std;
int main()
{
    //1.创建一个 "abcde" 为内容的 string 对象，size()值为 5，capacity()默认值为 15
    string str("abcde");
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.capacity() << endl << endl;
    //2.调整 size()值为 20 的 string 对象，而在 VS2022 中表现为填充 15 个空字符'\0'，而空字符是没有打印出来的
    str.resize(20);
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.capacity() << endl << endl;

    //3.调整 size()值为 30 的 string 对象，因此：前 5 个字符为 "abcde"，后面有 15 个'\0'，并且还有 10 个'c'字符，整体打印出 "abcdecccccccccc"
    str.resize(30, 'c');
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.capacity() << endl << endl;

    //4.缩小 size()值，并且可以看到，扩容的空间依旧有效
    str.resize(5);
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.capacity() << endl << endl;

    return 0;
}

2.1.7.5.reserve()

这个函数请求改变容量，要求将 string 对象调整到大小为 n 的容量（或者比 n 更大）这个函数对字符串的长度没有影响，也不能改变它的内容。

如果使用者做了缩小容量的操作，并且影响到原有 string 对象字符串的存储时，该函数会拒绝请求（即：non-binding 无约束力的）。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    //1.创建一个 "abcde" 为内容的 string 对象，size()值为 5，capacity()默认值为 15
    string str("abcde");
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.capacity() << endl << endl;
    //2.调整 capacity()值为 20 的 string 对象
    str.reserve(20);
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.capacity() << endl << endl;

    //3.调整 capacity()值为 30 的 string 对象
    str.reserve(30);
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.capacity() << endl << endl;

    //4.减小 capacity()值为 4 的 string 对象（失败请求）
    str.reserve(4);
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.capacity() << endl << endl;

    //4.增大 capacity()值为 10 的 string 对象
    str.reserve(10);
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.capacity() << endl << endl;

    return 0;
}

注意：函数名 reserve 有时候会和单词 reversal（反转）搞混，这个注意一下就行……

2.1.7.6.empty()

该函数返回字符串是否为空（即：size() 是否为 0)，这个函数不会以任何方式修改字符串的值。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    string str("abcdef");
    cout << str.empty() << endl;
    str.resize(0);//为空返回 true
    cout << str.empty() << endl;
    return 0;
}

2.1.7.7.shrink_to_fit()

shrink to fit 这个词的意思就是“缩小以适应”，该函数请求减少容量 capacity 来适应大小 size，当然这只是一种请求（即：non-binding 无约束力的）。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    string str("abcde");
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.capacity() << endl << endl;
    str.reserve(1000);//明明大小只有 5，容量确高达 1000 多，浪费资源
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.capacity() << endl << endl;

    str.shrink_to_fit();//使用该函数请求缩小容量（非强制）
    cout << str << endl;
    cout << str.size() << endl;
    cout << str.capacity() << endl << endl;

    return 0;
}

2.1.8.运算符

2.2.非成员函数重载

这里有两个比较麻烦的函数...

9.字符串运算（String operations）

9.1.c_str()和 data()

//函数声明
const char* c_str() const noexcept;
const char* data() const noexcept;

有时候为了和 C 兼容，需要使用 C 风格的字符串（以 \0）。而 c_str() 可以使得 string 对象返回底层的 C 风格的字符串成员变量，从这里我们也可以看到，虽然 string 不以 \0 作为结尾标志，但是依旧会在构造 string 对象的时候存储 \0，以保证能与 C 兼容。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    string str("abcdef");
    cout << "str" << " = " << str << " 类型：" << typeid(str).name() << endl << endl;
    char* ch;
    cout << typeid(ch).name() << endl;
    cout << "str.c_str()" << " = " << str.c_str() << " 类型：" << typeid(str.c_str()).name() << endl;
    cout << "str.data()" << " = " << str.data() << " 类型：" << typeid(str.data()).name() << endl;
    return 0;
}

而 data() 和 s_str() 的使用是等价的，没有太大区别。

#include <iostream>
#include <string>
#include <cstring>
using namespace std;
int main()
{
    int length;
    string str = "Test string";
    const char* cstr = "Test string";

    if (str.length() == strlen(cstr))
    {
        cout << "1" << endl;

        if (memcmp(cstr, str.data(), str.length()) == 0)
        {
            cout << "1" << endl;
        }
    }
    return 0;
}

9.2.cpoy()

//函数声明
size_t copy (char* s, size_t len, size_t pos = 0) const;

将 string 对象当前值，从 pos 位置开始的长度为 len 子串拷贝到 s 指向的数组中。

#include <iostream>
#include <string>
int main()
{
    std::string str("abcd");
    char s[3] = { 0 };//如果这里没有加入初始化{ 0 }，那么后续打印 s 数组就会出现乱码，原因就是找不到空字符
    str.copy(s, 2);
    std::cout << s << std::endl;
    return 0;
}

9.3.find()和 rfind()

size_t find (const string& str, size_t pos = 0) const noexcept;

size_t find (const char* s, size_t pos = 0) const;

size_t find (const char* s, size_t pos, size_type n) const;

size_t find (char c, size_t pos = 0) const noexcept;

该函数可以寻找字符串或者 string 对象中的内容，。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    //1.寻找 string 子串
    string str1("abcd-abcd-abcd");
    string str2("abcd");
    cout << str1.find(str2, 0) << endl;
    cout << str1.find(str2, 1) << endl;
    cout << str1.find(str2, 6) << endl << endl;
    //2.寻找 C 风格子串
    string str3("ABCD-ABCD-ABCD");
    const char* str4 = "ABCD";
    cout << str3.find(str4, 0) << endl;
    cout << str3.find(str4, 1) << endl;
    cout << str3.find(str4, 6) << endl << endl;

    //3.在范围内寻找 C 风格子串
    string str5("ABCD-AB-ABCD-AB-ABCD");
    const char* str6 = "ABCD";
    cout << str5.find(str6, 0, 2) << endl;
    cout << str5.find(str6, 1, 2) << endl;
    cout << str5.find(str6, 6, 2) << endl;
    cout << str5.find(str6, 9, 2) << endl;
    cout << str5.find(str6, 14, 2) << endl << endl;

    //4.寻找字符
    string str7("xxxxxxx_xxxx_xxxxxxxxxx_");
    char ch = '_';
    int j = -1;
    for (int i = 0; i < str7.size(); i++)
    {
        int z = str7.find(ch, i);
        if (j != z)
        {
            cout << z << " ";
            j = z;
        }
    }
    return 0;
}

而 rfind() 就是逆向查找，具体定义看下文档就行，使用和 find() 类似。

//重载声明
size_t rfind (const string& str, size_t pos = npos) const noexcept; 

size_t rfind (const char* s, size_t pos = npos) const; 

size_t rfind (const char* s, size_t pos, size_t n) const;

size_t rfind (char c, size_t pos = npos) const noexcept;

9.4.substr()

//函数声明
string substr (size_t pos = 0, size_t len = npos) const;

这个函数可以用来生成一个新的子串对象（从源 string 对象的 pos 开始的长度为 len 的子字符串）。

#include <iostream>
#include <string>
using namespace std;
int main()
{
    string str("awwwwf");
    cout << str.substr(1, 4) << endl;
    return 0;
}

9.5.compare()

//重载函数
int compare (const string& str) const noexcept;//比较两个 string 对象

int compare (size_t pos, size_t len, const string& str) const;//在源 string 对象的子串和 str 对象比较

int compare (size_t pos, size_t len, const string& str, size_t subpos, size_t sublen) const;//在源 string 对象的子串内和 str 对象的子串比较

int compare (const char* s) const;//和 C 风格的字符串比较

int compare (size_t pos, size_t len, const char* s) const;//string 对象的子串和 C 风格的字符串比较

int compare (size_t pos, size_t len, const char* s, size_t n) const;//string 对象的子串和 C 风格的字符串的子串比较

该函数主要是比较字符大小，和 C 语言的 strcmp() 类似返回指定的字符序列。

#include <iostream>
#include <string>
using namespace std;
// comparing apples with apples
#include <iostream>
#include <string>
int main()
{
    string str1("green apple");
    string str2("red apple");
    //g < r <=> g - r < 0
    //"green apple" < " red apple "
    if (str1.compare(str2) < 0)
        cout << str1 << " is not " << str2 << endl;

    //"apple" == "apple"
    if (str1.compare(6, 5, "apple") == 0)
        cout << "still, " << str1 << " is an apple" << endl;

    //"apple" == "apple"
    if (str2.compare(str2.size() - 5, 5, "apple") == 0)
        cout << "and " << str2 << " is also an apple" << endl;

    //"apple" == "apple" 
    if (str1.compare(6, 5, str2, 4, 5) == 0)
        cout << "therefore, both are apples" << endl;

    const char* str3 = "green apple";
    if (str1.compare(str3) == 0)
        cout << "one and the same" << endl;

    return 0;
}

不过这个函数不怎么使用，原因是后面有很多的判断符号重载，这些重载更加方便。

9.6.find_first_of()

//重载函数
size_t find_first_of (const string& str, size_t pos = 0) const noexcept;

size_t find_first_of (const char* s, size_t pos = 0) const;

size_t find_first_of (const char* s, size_t pos, size_t n) const;

size_t find_first_of (char c, size_t pos = 0) const noexcept;

在字符串中查找第一次出现指定字符集合中任何一个字符的位置。

int main()
{
    string str1 = "xxx_big_apple_xxx";
    string str2 = "be";
    cout << str1.find_first_of(str2, 0) << endl;//输出 4
    cout << str1.find_first_of(str2, 5) << endl;//输出 12
    return 0;
}

吐槽：这个函数的名字真不好理解……

9.7.find_last_of()

//重载函数
size_t find_last_of (const string& str, size_t pos = npos) const noexcept;

size_t find_last_of (const char* s, size_t pos = npos) const;

size_t find_last_of (const char* s, size_t pos, size_t n) const;

size_t find_last_of (char c, size_t pos = npos) const noexcept;

在字符串中查找最后一次出现指定字符集合中任何一个字符的位置。

int main()
{
    string str1 = "xxx_big_apple_xxx";
    string str2 = "be";
    cout << str1.find_last_of(str2, str1.size() - 1) << endl;//输出 12
    cout << str1.find_last_of(str2, 11) << endl;//输出 4
    return 0;
}

9.8.find_first_not_of()

//重载函数
size_t find_first_not_of (const string& str, size_t pos = 0) const noexcept;

size_t find_first_not_of (const char* s, size_t pos = 0) const;

size_t find_first_not_of (const char* s, size_t pos, size_t n) const;

size_t find_first_not_of (char c, size_t pos = 0) const noexcept;

这个函数可以查找不是指定字符集合中的任何一个字符的位置。

下面是一个例子（制作屏蔽词语）：

int main()
{
    string str("saldgcaslucdgleakjslaejcsfkajsasducgiasfg");
    size_t index = str.find_first_not_of("abcdef", 0);
    cout << str << endl;
    while (index != string::npos)
    {
        str[index] = '*';
        index = str.find_first_not_of("abcdef", index + 1);
    }
    cout << str << endl;
    return 0;
}

9.9.find_last_not_of()

//重载函数
size_t find_last_not_of (const string& str, size_t pos = npos) const noexcept;

size_t find_last_not_of (const char* s, size_t pos = npos) const;

size_t find_last_not_of (const char* s, size_t pos, size_t n) const;

size_t find_last_not_of (char c, size_t pos = npos) const noexcept;

上面这个函数就是从后面开始查找，这里就不再举例了……

9.10.get_allocator()

//函数声明
allocator_type get_allocator() const noexcept;

这个函数就比较复杂了，您可以先去查看一下文档，或者以后我们有机会再来详谈……

9.11.assign()

//重载函数
string& assign (const string& str);

string& assign (const string& str, size_t subpos, size_t sublen);

string& assign (const char* s);

string& assign (const char* s, size_t n);

string& assign (size_t n, char c);

template <class InputIterator>
   string& assign (InputIterator first, InputIterator last);

string& assign (initializer_list<char> il);

string& assign (string&& str) noexcept;

这个函数也不经常使用，实际在一种赋值的工作：

int main()
{
    string str1 = "abcdef";
    string str2 = "xxxxxx";
    str1.assign(str2);
    cout << str1 << endl;
    cout << str2 << endl;
    return 0;
}

10.常量成员（Member constants）

string 类的常量成员，也就是前面介绍的 npos，之前已经简单提到过，这里就放一下 npos 的定义使得本文章足够完整：

//常量定义
static const size_t npos = -1;

11.非成员函数重载（Non-member function overloads）

11.1.operator+()

string operator+ (const string& lhs, const string& rhs);

string operator+ (const string& lhs, const char* rhs);
string operator+ (const char* lhs, const string& rhs);

string operator+ (const string& lhs, char          rhs);
string operator+ (char lhs, const string& rhs);

+ 的代价要比 += 大，因此非必要不要轻易使用 +，这里就不演示 + 的使用了，您玩一下就行。

11.2.relational operators

一些比较运算符重载：

//(1)
bool operator== (const string& lhs, const string& rhs);
bool operator== (const char*   lhs, const string& rhs);
bool operator== (const string& lhs, const char*   rhs);

//(2)    
bool operator!= (const string& lhs, const string& rhs);
bool operator!= (const char*   lhs, const string& rhs);
bool operator!= (const string& lhs, const char*   rhs);

//(3)    
bool operator<  (const string& lhs, const string& rhs);
bool operator<  (const char*   lhs, const string& rhs);
bool operator<  (const string& lhs, const char*   rhs);

//(4)    
bool operator<= (const string& lhs, const string& rhs);
bool operator<= (const char*   lhs, const string& rhs);
bool operator<= (const string& lhs, const char*   rhs);

//(5)    
bool operator>  (const string& lhs, const string& rhs);
bool operator>  (const char*   lhs, const string& rhs);
bool operator>  (const string& lhs, const char*   rhs);

//(6)    
bool operator>= (const string& lhs, const string& rhs);
bool operator>= (const char*   lhs, const string& rhs);
bool operator>= (const string& lhs, const char*   rhs);

内部使用 string::compare() 函数来进行比较。

11.3.swap()

//重载声明
void swap (string& str);

全局交换函数，内部调用了成员函数 swap()，这个重载是为了防止调用到低效的全局交换函数。

11.4.operator >>()和 operator <<()

istream& operator>> (istream& is, string& str);
ostream& operator<< (ostream& os, const string& str);

这个就是辅助 string 进行流插入和流输出的重载，也是看下就行，IO 类的函数我们以后还会深入讲解。

11.5.getline()

istream& getline (istream& is, string& str, char delim);
istream& getline (istream& is, string& str);

这个函数使用的频率也比较高，主要是从流 is 中获取一行输入 string 对象，直到遇到文件结束符 EOF 分隔符，由于返回的是流 is 的引用，因此可以重复读取。

//方式 1
string str1;
cin >> str1;
cout << "str1:" << str1 << endl;

//方式 2
string str2;
getline(cin, str2);
cout << "str2:" << str2 << endl;

//方式 3
string str3;
getline(cin, str3, '$');//遇到 $就停止读取，抛弃掉$，不再进入后续的输入
cout << "str3:" << str3 << endl;

方式 1 不可以输入带有空格的，方式 2 可以，方式 3 可以指定结束符号。

12.string 相关题目练习

12.1.字符串中的第一个唯一字符

class Solution
{
public:
    int firstUniqChar(string s)
    {
        //1.开始计数，生成计数表
        int arr[26] = { 0 };//先定义一个数组存储每个字符的出现个数
        for (auto ch : s)
        {
            arr[ch - 'a']++;//开始计数
        }

        //2.根据 string 对象和计数表寻找索引
        int i = 0;
        for (i = 0; i < s.size(); i++)
        {
            if (arr[s[i] - 'a'] == 1)
            {
                return i;
            }
        }
        return -1;
    }
};

12.2.将网址分割为：协议、域名、资源名

int mian()
{
    //1.输入网址 url
    string url;//https->://baidu.com->/limou/file/text
    cin >> url;
    //2.寻找并且拆解 url
    string protocol;//协议
    string domain;//域名
    string resource;//资源
    //2.1.寻找协议
    int index1 = url.find(":");//5
    if (index1 == string::npos)
    {
        cout << "没有协议，退出程序" << endl;
        exit(-1);
    }
    //2.2.寻找域名
    int index2 = url.find("/", index1 + 3);//17
    if (index2 == string::npos)
    {
        index2 = url.size();
    }
    //2.3.赋值三个对象
    protocol = url.substr(0, index1);
    domain = url.substr(index1 + 3, index2 - (index1 + 3));//17-5-3 = 12-3 = 9
    if (index2 != url.size())
    {
        resource = url.substr(index2 + 1);
    }

    //3.输出结果
    cout << "protocol = " << protocol << endl;
    if (domain.size() != 0)
    {
        cout << "domain = " << domain << endl;
    }
    else
    {
        cout << "没有域名" << endl;
    }
    if(resource.size() != 0)
    { 
        cout << "resource = " << resource << endl;
    }
    else
    {
        cout << "没有资源" << endl;
    }
    return 0;
}

12.3.仅仅反转字母

class Solution 
{
public:
    bool IsLetter(const char& ch)
    {
        if ((ch >= 'a' && ch <= 'z')
            || (ch >= 'A' && ch <= 'Z'))
        {
            return true;
        }
        return false;
    }
    string reverseOnlyLetters(string str)
    {
        auto begin = 0;
        auto end = str.size() - 1;
        while (begin < end)
        {
            while (!IsLetter(str[begin]) && begin < end)
            {
                begin++;
            }
            while (!IsLetter(str[end]) && begin < end)
            {
                end--;
            }
            swap(str[begin], str[end]);
            begin++;
            end--;
        }
        return str;
    }
};

12.4.字符串最后一个单词的长度

#include <iostream>
using namespace std;
int main()
{
    //1.输入字符句子
    string str;
    getline(cin, str);//hello nowcode
    //2.逆向寻找
    size_t index = str.rfind(" ");//得到 5
    cout << str.size() - index - 1 << endl;//13-5 = 8  
}

但是这么做有些不严谨，再严谨点就是下面代码：

#include <iostream>
using namespace std;

int main()
{
    //1.输入字符句子
    string str;
    getline(cin, str);//hello nowcode
    //2.逆向寻找
    size_t index = str.rfind(" ");//得到 5
    if (index != string::npos)
    {
        cout << str.size() - index - 1 << endl;//13-5 = 8   
    }
    else
    {
        cout << str.size();
    }
}

12.5.验证回文串

class Solution
{
public:
    bool IsLetterAndNumber(const char& ch)//判断是否为字符或者数字
    {
        return((ch >= 'a' && ch <= 'z')
            || (ch >= 'A' && ch <= 'Z')
            || (ch >= '0' && ch <= '9'));
    }
    bool IsOk(char x, char y)//判断是否构成回文的特征
    {
        if (x == y)
        {
            return true;
        }
        if ((x >= 'a' && x <= 'z') && (y >= 'A' && y <= 'Z'))
        {
            if (x - 'a' + 'A' == y)
            {
                return true;
            }
        }
        else if ((y >= 'a' && x <= 'z') && (x >= 'A' && x <= 'Z'))
        {
            if (y - 'a' + 'A' == x)
            {
                return true;
            }
        }
        return false;
    }
    bool isPalindrome(string str)
    {
        //1.取得首尾索引
        //str = "A man, a plan, a canal: Panama"
        int begin = 0;//s [begin] = 'A' 
        int end = str.size() - 1;//str [end] = 'a'
        //2.循环判断回文
        while (begin < end)
        {
            while (begin < end && !IsLetterAndNumber(str[begin]))
            {
                begin++;
            }
            while (begin < end && !IsLetterAndNumber(str[end]))
            {
                end--;
            }
            if (!IsOk(str[begin], str[end]))//如果判断出是相同的就返回 true
            {
                return false;
            }
            begin++;
            end--;
        }
        return true;
    }
};

#pragma once
#define _CRT_SECURE_NO_WARNINGS 1
#include <iostream>
#include <string>
#include <cassert>
namespace limou//标准命名空间
{
    class string
    {
    public:
        typedef char* iterator;//迭代器的类型，迭代器实际上是重命名来的
        typedef const char* const_iterator;//常量迭代器
    public:
        //1.成员函数
        //1.1.构造函数
        string(const char* s = "")//缺省参数传递空字符
        {
            std::cout << "string(const char* s = "")" << std::endl;
            _size = strlen(s);//str 的长度，不计入'\0'
            _capacity = 15;//str 的默认容量，不计入'\0'
            _s = new char[_capacity + 1];//默认申请 16 个空间其中一个存储'\0'
            while (_size > _capacity)//如果容量不够了，就会进行扩容
            {
                delete[] _s;
                _s = new char[_capacity * 2];
                _capacity *= 2;
            }
            strcpy(_s, s);
        }
        //1.2.析构函数
        ~string()
        {
            std::cout << "~string()" << std::endl;
            delete[] _s;
        }
        //1.3.拷贝构造+赋值重载函数
        string(const string& str)
            : _s(nullptr)//这里的初始化列表就可以避免后续交换的时候，析构 cache 出现奔溃的现象（delete 了随机值）
            , _size(0)
            , _capacity(0)
        {
            //1.传统写法
            //std:: cout << " string(const string& str)" << std:: endl;
            //_size = str._size;
            //_capacity = str._capacity;
            //_s = new char [_capacity + 1];
            //strcpy(_s, str._s);
            //2.现代写法
            std::cout << "string(const string& str)" << std::endl;
            string cache(str._s);//构造一个
            std::swap(_s, cache._s);
            std::swap(_size, cache._size);
            std::swap(_capacity, cache._capacity);
        }
        //string& operator =(const string& str)
        //{
        //    //1.传统写法
        //    //if (this != &str)
        //    //{
        //    //    std:: cout << " string operator =(const string& str)" << std:: endl;
        //    //    char* cache = new char [str._capacity + 1];
        //    //    strcpy(cache, str._s);
        //    //    delete [] _s;
        //    //    _s = cache;
        //    //    _size = str._size;
        //    //    _capacity = str._capacity;
        //    //}
        //    //return *this;
        //    //2.现代写法（实际上和传统写法的效率差别不大）
        //    //string str1("abcd");
        //    //string str2;
        //    //str2 = str1;
        //    std:: cout << " string operator =(const string& str)" << std:: endl;
        //    if (this != &str)
        //    {
        //        string cache(str);//这里是调用了拷贝构造
        //        std:: swap(_s, cache._s);
        //        std:: swap(_size, cache._size);
        //        std:: swap(_capacity, cache._capacity);
        //    }
        //    return *this;
        //}
        //如果写得再现代一点，还能吧拷贝的步骤省略
        string& operator=(string cache)//这里是调用了拷贝构造
        {
            std::cout << "string operator=(const string& str)" << std::endl;
            std::swap(_s, cache._s);
            std::swap(_size, cache._size);
            std::swap(_capacity, cache._capacity);
            return *this;
        }
        //2.迭代器
        iterator begin()
        {
            return _s;
        }
        iterator end()
        {
            return _s + _size;//若 "abcd", size = 4,_s-> a,_s+4->'\0'
        }
        const_iterator begin() const
        {
            return _s;
        }
        const_iterator end() const
        {
            return _s + _size;
        }
        //3.容积
        const char* c_str() const
        {
            return _s;
        }
        const size_t size() const
        {
            return _size;
        }
        const size_t capacity() const
        {
            return _capacity;
        }
        void reserve(size_t n)//改变容量
        {
            if (n > _capacity)
            {
                _capacity = n;//改变容量
                char* cache = new char[n + 1];//创建对应容量的空间
                strcpy(cache, _s);//拷贝源字符串数据，会拷贝'\0'
                delete[] _s;//销毁源字符串
                _s = cache;//
            }
            else
            {
                std::cout << "reserve操作只能扩容" << std::endl;
            }
        }
        void resize(size_t n, char ch = '\0')
        {
            if (n <= _size)
            {
                _s[n] = '\0';
                _size = n;
            }
            else
            {
                reserve(n);
                while (_size < n)
                {
                    _s[_size] = ch;
                    _size++;
                }
                _s[_size] = '\0';
            }
        }
        void clear()
        {
            erase(0);
        }
        //3.访问
        char& operator[](size_t pos)
        {
            assert(pos <= _size);//这里设计的和库是一样的，可以访问到'\0'
            return _s[pos];
        }
        //4.修改器
        void push_back(char ch)//push_back 单字符
        {
            //若 "abcd",_size = 4,_capacity = 15, 空间足够, 无需扩容
            //若 "aaaaabbbbbccccc",_size = 15,_capacity = 15, 空间不够, 需要扩容
            if (_size == _capacity)
            {
                reserve(_capacity * 2);//改变容量为原来的 2 倍
            }
            _s[_size++] = ch;
            _s[_size] = '\0';
        }
        void append(const char* s)//append 字符串
        {
            size_t len = _size + strlen(s);
            if (len > _capacity)
            {
                reserve(len);//改变容量
            }
            strcpy(_s + _size, s);
            _size += strlen(s);
        }
        string& operator+=(char ch)
        {
            push_back(ch);
            return *this;
        }
        string& operator+=(const char* str)
        {
            append(str);
            return *this;
        }
        void inster(size_t pos, char ch)
        {
            //个人看法：顺序表挪动数据从越界位置的索引开始比较好（也就是下面 end =_size + 1, 从越界的 end 开始）
            //"abcdef\0", pos = 2, pos-> c, s[_size]='\0', "ab cdef"
            //"\0", pos = 0, pos->'\0', s[_size]='\0', " \0"
            assert(pos <= _size);
            if (_size == _capacity)
            {
                reserve(_capacity * 2);
            }
            size_t end = _size + 1;
            while (end > pos)
            {
                _s[end] = _s[end - 1];
                end--;
            }
            _s[pos] = ch;
            _size++;
        }
        void inster(size_t pos, const char* s)
        {
            //"abcde\0", pos = 1, pos-> b, s = "xxx", "abcde\0"-> "a   bcde\0"
            assert(pos <= _size);
            size_t len = strlen(s);
            if (len + _size > _capacity)//容量不够
            {
                reserve(len + _size);//开辟好足够的空间
            }
            size_t end = _size + len;//end = 5+3 = 8
            while (end > pos)//8 > 1
            {
                _s[end] = _s[end - len];//_s [8] = _s [5]
                end--;
            }
            strncpy(_s + pos, s, len);
            _size += len;
        }
        void erase(size_t pos, size_t len = npos)//len 没有指定就会全部删除
        {
            //擦除字符串的一部分，并且减少_size
            //"bacdef\0", pos = 1, len = 3, "b   ef\0"-> "bef\0"
            assert(pos <= _size);
            if (_size == 0)//如果字符串大小为 0 就直接返回，没必要清理了
                return;
            if (len == npos || pos + len >= _size)//如果没有指定 len 就直接删除 pos 之后的所有字符串（或者超出范围）
            {
                _s[pos] = '\0';
                _size = pos;
                return;
            }

            //对于指定 pos 就如下面处理
            size_t begin = pos + len;//begin = 1+3 = 4,_s [4] ='e'
            while (begin <= _size)//begin < 6
            {
                _s[begin - len] = _s[begin];//_s [1] =_s [4],'a'->'e'
                begin++;
            }
            _size -= len;
        }
        //5.比较（库里的写成全局是因为支持其他的类实例化）
        bool operator<(const string& str)
        {
            return (strcmp(_s, str._s) < 0);
        }
        bool operator==(const string& str)
        {
            return (strcmp(_s, str._s) == 0);
        }
        bool operator<=(const string& str)
        {
            return *this < str || *this == str;
        }
        bool operator>(const string& str)
        {
            return !(*this <= str);
        }
        bool operator!=(const string& str)
        {
            return !(*this == str);
        }
        bool operator>=(const string& str)
        {
            return !(*this < str);
        }
        //6.字符串运算
        size_t find(char ch, size_t pos = 0)
        {
            //"abcd\0", pos = 1, pos-> b, ch = d
            for (size_t i = pos; i < _size; i++)
            {
                if (_s[i] == ch)
                {
                    return i;
                }
            }
            return npos;
        }
        size_t find(const char* sub, size_t pos = 0)
        {
            //这里也可以使用 kmp 算法，但是对于新手来说比较复杂，并且在实践中的效率一般（哪怕 STL 中也没有使用这个算法）字符串匹配更好的方法是 BM 算法
            //这里我们就直接使用 C 语言提供的接口 strstr()
            const char* p = strstr(_s + pos, sub);
            if (p != NULL)
            {
                return p - _s;
            }
            return npos;
        }
        string substr(size_t pos, size_t len = npos)
        {
            //"abcdef\0", pos = 1, pos-> b, len = 4, substr = "bcde"
            //1.写法 1（冗余一些）
            //string str;
            //if (len == npos || pos + len >= _size)
            //{
            //    len = _size - pos;//重新计算长度
            //    str.reserve(len);//提前开好空间
            //    for (size_t i = pos; i < _size; i++)
            //    {
            //        str += _s [i];
            //    }
            //}
            //else
            //{
            //    str.reserve(len);//提前开好空间
            //    for (size_t i = pos; i < pos + len; i++)
            //    {
            //        str += _s [i];
            //    }
            //}
            //return str;
            //2.写法 2（精简一些）
            string str;
            size_t end = pos + len;
            if (len == npos || pos + len >= _size)
            {
                len = _size - pos;//重新计算长度
                end = _size;
            }
            str.reserve(len);//提前开好空间
            for (size_t i = pos; i < end; i++)
            {
                str += _s[i];
            }
            return str;
        }
    private:
        char* _s;
        size_t _size;
        size_t _capacity;
    public:
        static const size_t npos = -1;
        /*
        吐槽：单纯的静态成员不可以在这里定义（也没有办法传给初始化列表），
        但是 C++又搞了一个特例，const static 的整形变量可以直接在这里定义。
        这是一个比较离谱的设计，记着就行，一般也不会这么写，容易坑人……因为这个特例会破坏语法规则
        */
    };
    //const size_t string:: npos = -1;
    //7.流重载（并不一定要写成友元）
    std::ostream& operator<<(std::ostream& out, const string& str)
    {
        for (auto ch : str)
            out << ch;
        return out;
    }
    std::istream& operator>>(std::istream& in, string& str)
    {
        //1.这里注意下面这样的代码是解决不了问题的，因为 cin 遇到空格就会自动跳过
        //char ch;
        //in >> ch;
        //while (ch != ' ' && ch != '\n')
        //{
        //    str += ch;
        //    in >> ch;
        //}
        //return in;
        //2.更正写法如下：
        //str.clear();//为了避免变成尾插，还需要清楚原有的数据
        //char ch;
        //ch = in.get();//这是 istream 类提供的一个函数
        //while (ch != ' ' && ch != '\n')
        //{
        //    str += ch;
        //    ch = in.get();
        //}
        //return in;
        //3.不过上述扩容有些频繁，还可以在进行改进
        str.clear();//为了避免变成尾插，还需要清楚原有的数据

        char buff[128 + 1];//缓存区
        size_t i = 0;

        char ch;
        ch = in.get();//这是 istream 类提供的一个函数
        while (ch != ' ' && ch != '\n')
        {
            buff[i++] = ch;
            if (i == 128)
            {
                buff[i] = '\0';
                str += buff;//加载数据
                i = 0;//置空缓存区
            }
            ch = in.get();
        }
        if (i != 0)
        {
            buff[i] = '\0';
            str += buff;//加载数据
        }
        return in;
    }

    //8.测试
    //构造测试
    void test_1()
    {
        //1.库中的默认构造实现（测试）
        std::string str;
        std::cout << str.c_str() << std::endl;
        std::cout << "str.size = " << str.size() << std::endl;
        std::cout << "str.capacity = " << str.capacity() << std::endl;
        std::cout << "----" << std::endl << std::endl;

        //2.我们自己的构造实现（测试）
        //string()
        string str0;
        const char* s0 = str0.c_str();
        std::cout << s0 << std::endl;
        std::cout << "str0.size = " << str0.size() << std::endl;
        std::cout << "str0.capacity = " << str0.capacity() << std::endl;
        //string(const char* s)
        string str1("Hello, My name is Limou3434.");
        const char* s1 = str1.c_str();
        std::cout << s1 << std::endl;
        //string(const string& str)
        string str2(str1);
        const char* s2 = str2.c_str();
        std::cout << s2 << std::endl;
        //string& operator =(const string& str)
        string str3("aaaa");
        string str4("bbbb");
        string str5;
        str5 = (str4 = str3);
        const char* s3 = str3.c_str();
        const char* s4 = str4.c_str();
        const char* s5 = str5.c_str();
        std::cout << s3 << " | " << s4 << " | " << s5 << std::endl;
    }
    //迭代器测试
    void test_2()
    {
        //1.使用 [] 遍历访问
        string str1("Hello");
        for (int i = 0; str1[i]; i++)
        {
            std::cout << str1[i]++ << " ";
        }
        std::cout << std::endl;
        for (int i = 0; i < str1.size(); i++)
        {
            std::cout << str1[i] << " ";
        }
        std::cout << std::endl;

        //2.使用迭代器遍历
        string str2("Iamlimou");
        limou::string::iterator it = str2.begin();
        while(it < str2.end())
        {
            std::cout << (*it)++ << " ";
            it++;
        }
        std::cout << std::endl;
        it = str2.begin();
        while(it < str2.end())
        {
            std::cout << *it << " ";
            it++;
        }
        std::cout << std::endl;

        //3.使用范围 for，本质为迭代器（测试）
        string str3;
        str3 = "WordWordWord";
        for (auto& ch : str3)
        {
            std::cout << ch++ << " ";
        }
        std::cout << std::endl;
        for (auto& ch : str3)
        {
            std::cout << ch << " ";
        }
        std::cout << std::endl;
    }
    //修改器测试
    void test_3()
    {
        //1.push_back()
        string str1 = "abcd";
        str1.push_back('e');
        std::cout << str1.c_str() << std::endl;

        for (int i = 0; i < 50; i++)
        {
            str1.push_back('x');
            std::cout << str1.c_str() << std::endl;
        }
        std::cout << std::endl;

        //2.append()
        str1.append("________________________________");
        std::cout << str1.c_str() << std::endl;

        //3.operator+=()
        string str2;
        str2 += '#';
        str2 += '#';
        str2 += '#';
        str2 += '#';
        str2 += '#';
        str2 += '#';
        str2 += '#';
        str2 += '#';
        str2 += '#';
        str2 += '#';
        str2 += '#';
        str2 += '#';
        str2 += '#';
        str2 += '#';
        std::cout << str2.c_str() << std::endl;
        str2 += "######################################################";
        std::cout << str2.c_str() << std::endl;

        //4.inster()
        string str4;
        for (int i = 0; i < 10; i++)
        {
            str4.inster(0, '#');
            std::cout << str4.c_str() << std::endl;
        }
        str4.inster(1, '%');
        std::cout << str4.c_str() << std::endl;
        str4.inster(str4.size(), '%');
        std::cout << str4.c_str() << std::endl;

        string str5;
        str5.inster(0, "xxx");
        std::cout << str5.c_str() << std::endl;
        str5.inster(3, "aaaaaaaaaaaaaaa");
        std::cout << str5.c_str() << std::endl;
        str5.inster(5, "*");
        std::cout << str5.c_str() << std::endl;

        //5.erase()
        string str6("abcdef");
        str6.erase(2, 3);
        std::cout << str6.c_str() << std::endl;
        str6.erase(0);
        std::cout << str6.c_str() << std::endl;

        str6.push_back('c');
        std::cout << str6.c_str() << std::endl;

        string str7("xxcdef");
        str7.erase(2, 100);
        std::cout << str7.c_str() << std::endl;
    }
    //比较测试
    void test_4()
    {
        string str1 = "a";
        string str2 = "a";
        string str3 = "b";
        string str4 = "c";
        if (str1 < str3)
        {
            std::cout << "true" << std::endl;
        }        
        if (str1 == str2)
        {
            std::cout << "true" << std::endl;
        }        
        if (str1 <= str3)
        {
            std::cout << "true" << std::endl;
        }
        if (str1 <= str2)
        {
            std::cout << "true" << std::endl;
        }
        if (str4 > str3)
        {
            std::cout << "true" << std::endl;
        }        
        if (str1 != str3)
        {
            std::cout << "true" << std::endl;
        }        
        if (str4 >= str3)
        {
            std::cout << "true" << std::endl;
        }
        if (str4 >= str4)
        {
            std::cout << "true" << std::endl;
        }
    }
    //流测试
    void test_5()
    {
        string str1("abc");
        string str2("efg");
        std::cin >> str1;
        std::cin >> str2;
        std::cout << str1 << std::endl;
        std::cout << str2 << std::endl;
    }
    //容积测试
    void test_6()
    {
        string str1("abcdefehi");
        str1.resize(3);
        std::cout << str1.c_str() << std::endl;
        str1.resize(10, 'x');
        std::cout << str1.c_str() << std::endl;

        string str2;
        str2.resize(10, 'x');
        std::cout << str2.c_str() << std::endl;
        std::cout << str2.size() << std::endl;
        std::cout << str2.capacity() << std::endl;

        str2.resize(101, '?');
        std::cout << str2.c_str() << std::endl;
        std::cout << str2.size() << std::endl;
        std::cout << str2.capacity() << std::endl;

        str2 += "abcd";
        std::cout << str2.c_str() << std::endl;
        std::cout << str2.size() << std::endl;
        std::cout << str2.capacity() << std::endl;
    }
    //运算测试
    void test_7()
    {
        string str("limou3434");
        std::cout << str.find('i') << std::endl;

        std::cout << str.find("mou") << std::endl;

        string substr = str.substr(1, 3);
        std::cout << substr.c_str() << std::endl;

        substr = str.substr(4);
        std::cout << substr.c_str() << std::endl;
    }
    //sizeof 测试
    void test_8()
    {
        std::string str1;
        std::string str2 = "abcdabcdabcdabcdabcdabcd";
        std::cout << sizeof(str1) << std::endl;
        std::cout << sizeof(str2) << std::endl;
        //在库里面，会多出一个_buff [16]
        //如果字符数小于 16，就会存储到 buff 里
        //如果字符数大于 16，就会存储到_s 指向的空间里
        //_buff [16] 为 16 字节
        //_str 为 8 字节  
        //_size 为 8 字节
        //_capacity 为 8 字节
        //因此体现为 40 字节大小的空间
        //在 VS 的调试监视窗口中的原始视图里可以找到 buff 数组
    }

}
/*
注意我们写的只是 string 类，而不是 basic_string 类模板
而不是一个类模板，因此有些地方上面的代码对于其他的实例化可能不够适用
*/

/*
另外这里再提及一个浅拷贝的知识：
深拷贝的代价太大了，那么也没有其他办法解决浅拷贝的问题呢？
    1.首先浅拷贝的主要问题如下：
        1.1.析构了两次
        1.2.一次修改影响多个对象
    2.针对上述问题有一些解决方案：
        2.1.解决问题一：这个时候我们引入一个方法：“引用计数”，
                    “引用计数”可以记录有多少个对象指向同一块空间，
                    析构的时候就减少一次计数，除非减到 0 才可以释放。
                    （这里只是简单提及，以后使用智能指针的时候还会再提）
        2.2.解决问题二：上述“引用计数”解决了两次析构的问题，
                    但是没有解决一次修改多次改变的问题，
                    因此又引入了“写时拷贝”，只有写的时候才进行拷贝。
                    当“引用计数”不为 1 时，在写入空间的时候先进行深度拷贝再写入。
                    当“引用计数”为 1 就直接写入空间进行修改。
                    也就是说“写时拷贝”是一种‘拖延’的方案，最终还是要进行深拷贝，
                    只不过只在写的时候进行深拷贝而已
    3.在拷贝的时候如果使用了上述方案，如果没有进行“对空间写入”，就会提高一些效率
当然上述只是简单提及一下，为以后的知识进行铺垫罢了。

另外补充一下：在有些 VS 版本下没有采用上述的方案，是直接使用深度拷贝的，这点可以通过下面代码来证明。
#include <iostream>
#include <string>
using namespace std;
int main()
{
    string str1("abcd");
    string str2(str1);
    cout << (void*)str1.c_str() << endl;
    cout << (void*)str2.c_str() << endl;
    str2 += 'e';
    cout << (void*)str1.c_str() << endl;
    cout << (void*)str2.c_str() << endl;
    return 0;

}
而 G++下的 string 类是通过写时拷贝来实现的，string 对象总共占去 4 个字节（32 位），
内部只包含一个指针，该指针指向堆空间，内部包含:
    (1)空间总大小
    (2)字符串有效长度
    (3)引用计数
指向堆空间的指针（用来存储字符串）”
*/

补充：什么是 string::pos 呢？string::pos 实际上是 string 内部的常量成员，下面是其定义：

static const size_t npos = -1;

我们知道 size_t 是无符号整数类型的别名，因此 npos 取得了有符号的一个最大值，这基本不可能是 str 的长度，也可能是字符串的索引值。

len 默认值为 npos 的目的是：如果没有给予 len 的值，将会一直读取到字符串的末尾，这一点细节我们实现一个简易的 string 类再来细细探究。

补充：std::initializer_list 是一个模板类，它允许创建一个包含任意数量元素的列表。它类似于数组，可以通过下标访问其中的元素，但与数组不同的是，std::initializer_list 是不可变的，不能进行元素的添加或删除操作。（但是这不是初始化列表的意思！）

更新日志

2026/1/25 16:36

查看所有更新日志

403d4-修改 Github 工作流的配置，以方便未来支持前后端拓展于 2026/1/25

版权所有

版权归属：limou3434

许可证：署名 4.0 国际 (CC-BY-4.0)

String

本站公告